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ON THE PSYCHOPHYSICAL LAW* 


S. S. STEVENS 


Harvard University 


After dealing his critics their deserts, 
Fechner concluded his polemic of 1877 
with a defiant five-line Nachwort (10). 


The tower of Babel was never finished be- 
cause the workers could not reach an under- 
standing on how they should build it; my psy- 
chophysical edifice will stand because the work- 
ers will never agree on how to tear it down. 


For nearly a century the critics have 


pecked at the edifice in the disorganized - 


manner that inspired no fear in Fechner. 
They have contended that sensation is 
not measurable, and that it is; that 
Weber’s law is not true, and that it is; 
that the just noticeable difference is not 
a proper unit, and that it is; that the 
logarithmic law is not found in experi- 
ment, and that it is. As James put it, 
Fechner’s critics flailed about, “smiting 
his theories hip and thigh and leaving 
not a stick of them standing . Fs 
but here we are a hundred years later 
still discussing Fechner. 

The lesson of history is that a bold 
and plausible theory that fills a scien- 
tific need is seldom broken by the im- 
pact of contrary facts and arguments. 
Only with an alternative theory can we 
hope to displace a defective one. 

The purpose here is to try to do just 


1 This research was supported partly by a 
grant from the National Science Foundation, 
partly by Contract Nonr-1866 (15) with the 
Office of Naval Research (Project Nr142-201, 
Report PNR-188). Reproduction for any 
purpose of the U. S. Government is permitted. 
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that—to try to show that there is a gen- 
eral psychophysical law relating subjec- 
tive magnitude to stimulus magnitude, 
and that this law is simply that equal 
stimulus ratios produce equal subjective 
ratios. On numerous perceptual con- 
tinua, direct assessments of subjective 
magnitude seem to bear an orderly re- 
lation to the magnitude of the stimu- 
lus. To a fair first-order approximation, 
the ratio scales constructed by “direct” 
methods (as opposed to the indirect pro- 
cedures of Fechner) are related to the 
stimulus by a power function of one de- 
gree or another. Evidence for this fact 
is piling up under the impact of ratio- 
scaling procedures whose development 
over the past three decades has given a 
“new look” to psychophysics. The fruits 
of these new pursuits have implications 
and consequences for the traditional 
issues in the area of psychophysics, as 
well as for certain related issues con- 
cerned with category judgments, time- 
order errors, and scaling theory. 

That the Fechnerian philosophy of 
indirect measurement will fade quietly 
away as soon as it has been shown what 
direct methods can achieve is scarcely 
to be counted on, for elaborations and 
applications of this philosophy pervade 
important segments of the psychologi- 
cal activity. Curiously enough, modern 
Fechnerians are found less frequently 
among the psychophysicists than among 
the psychometricians and scale construc- 
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tors. It is they who sometimes assume 
that “equally often noticed differences 
are equal.” As elaborated in Thur- 
stone’s “law of comparative judgment,” 
this principle is essentially Fechner’s 
principle: namely, that the unit of meas- 
urement is given by resolving power. If 
not explicitly, at least by implication, 
this philosophy of indirect measurement 
asserts that all we can know about mag- 
nitude is what confusion tells us. Vari- 
ability becomes the measure of things, 
and the mean is meaningless. But on 
an important class of those simpler psy- 
chological continua where these notions 
are testable, we can show that equally 
often noticed differences are not equal, 
and that a scale proportional to psycho- 
logical magnitude is not achieved by 
procedures that try to transform vari- 
abilities, discriminal dispersions, or con- 
fusions into units of measure. 

Before we consider this point further 
we will need to draw a distinction be- 
tween two classes of continua. 


Two C1asses oF CONTINUA 


Perceptual continua divide themselves 


into two general classes (64). The 
nature of this division is suggested in 
a general way by the traditional di- 
chotomy between quantity and quality. 
Continua having to do with how much 
belong to what we have called Class I, 
or prothetic; continua having to do with 
what kind and where (position) belong 
to Class II, or metathetic. Class I 
seems to include, among other things, 
those continua on which discrimination 
is mediated by an additive or prothetic 
process at the physiological level (53, 
67). An example is loudness, where we 
progress along the continuum by add- 
ing excitation to excitation. Class II 
includes continua on which discrimina- 
tion is mediated by a physiological proc- 
ess that is substitutive, or metathetic. 
An example is pitch, where we progress 
along the continuum by substituting ex- 
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citation for excitation, i.e., by changing 
the locus of excitation. 

We know too little about physiologi- 
cal mechanisms to say whether all Class 
I continua are based on additive proc- 
esses or all Class II continua on substi- 
tutive processes, but in those instances 
where the facts seem clear the parallels 
between function and physiology are at 
least suggestive. Until our knowledge 
stands improved, it is perhaps best to 
classify the perceptual continua by the 
pragmatic criteria of the way they be- 
have and to hope that any uniformities 
we can discover will lead to deeper in- 
sights into basic mechanisms. 

Four functional criteria are relevant 
to the distinction between prothetic and 
metathetic continua, although they have 
not all been tested with equal thorough- 
ness. These four criteria concern the 
subjective size of the j.n.d., the form 
of category rating-scales, the time-order 
error, and hysteresis. It is likely, of 
course, that other criteria will be dis- 
covered if we continue to look for them. 

1. Subjective size of the jn.d. I list 
this criterion first because it is the 
growth in the subjective size of the j.n.d. 
as we go up the scale on a Class I 
or prothetic continuum that seems, in a 
sense, to “explain” at least two of the 
other three criteria. Fechner proposed 
that scales can be constructed by count- 
ing off just noticeable differences. The 
implication is that the sensation pro- 
duced by a stimulus 50 j.n.d.’s above 
threshold is half as great as that pro- 
duced by a stimulus 100 j.n.d.’s above 
threshold. The hard fact of the matter 
is that if the typical subject were con- 
fronted with two such stimuli on a Class 
I continuum he would assert with cer- 
tainty that the ratio between the two 
sensations is greater than two, because 
scales obtained by summating j.n.d.’s 
are nonlinearly related to ratio scales of 
subjective magnitude (33, 52). But 
more about this in a later section. The 
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point here is that on Class I continua 
the j.n.d.’s are not equal in subjective 
size. 

On continua of Class II (metathetic) 
the j.n.d.’s turn out to be approximately 
equal in subjective size when measured 
by magnitude scales of the continuum. 
The linearity between the mel scale of 
subjective pitch and the j.n.d. scale for 
frequency is a case in point (56) and 
it seems reasonable to suppose that the 
same linear relation is approximated on 
such continua as position and inclina- 
tion. 

2. Category rating-scales. A _ cate- 
gory rating-scale is the function ob- 
tained when a subject judges a set of 
stimuli in terms of a set of categories 
labeled either by numbers or by adjec- 
tives. The form of these scales is dif- 


ferent on the two kinds of continua. As 
shown by studies of a dozen perceptual 
dimensions (64), the category scales on 
continua of Class I are concave down- 
ward when plotted against a ratio scale 


of the subjective magnitude. Category 
scales on continua of Class II may be 
linear when so plotted. 

The chief factor that produces non- 
linearity in the category scales of Class I 
is variation in the subject’s sensitivity to 
differences. Near the lower end of the 
scale where discrimination is good the 
categories tend to be narrow, and by 
consequence the slope of the function is 
steep. Near the upper end, where a 
given stimulus difference is less easy to 
detect, the categories broaden and the 
slope declines. Only on Class II con- 
tinua, where sensitivity (measured in 
subjective units) remains relatively con- 
stant, is it ordinarily possible to produce 
category scales that are linearly related 
to subjective magnitude. 

Prothetic continua of Class I on which 
category scales have proved noniinear 
include apparent length, area, numer- 
ousness, duration, heaviness, lightness, 
brightness and loudness. Metathetic 
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DURATION IN SECONDS 


Fic. 1. J.n.d. scale, category scale, and ratio 
scale for apparent duration. Triangles: Mean 
judgments of 12 subjects who estimated the 
apparent duration of a white noise. Stimuli 
were presented in a different irregular order 
to each subject. Circles: Mean category judg- 
ments made by 16 subjects on a scale from 
1 to 7. The end stimuli were presented at the 
outset to indicate the range, and each subject 
judged each duration twice in a random or- 
der. Stimulus spacing was adjusted to pro- 
duce a “pure” category scale. Dashed line: 
Summated j.n.d.’s, right-hand ordinate. 


continua of Class II on which category 
scales have proved more or less linear 
include visual position, inclination, pro- 
portion, and pitch. On both types of 
continua alterations in the form of the 
category scale may be produced by a 
variety of factors, including stimulus 
spacing (or relative frequency of pres- 
entation), landmarks, and differential 
familiarity. 

By means of an iterative series of 
experimeuis we can achieve a stimulus 
spacing that will neutralize the effects of 
spacing and produce a “pure” category 
scale, uncontaminated by the subject’s 
expectations regarding the frequency 
with which he should name the various 
categorivs. An example of an approxi- 
matica to a pure category scale is shown 
in Sig. 1 (circles). This experiment 
was a follow-up to the experiments on 
duretion reported earlier (64). Guided 
by the outcome of the earlier studies, I 
so spaced the stimuli that the 16 sub- 
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jects tended to name the various cate- 
gories equally often. The proof that a 
pure category scale has been approxi- 
mated lies in the fact that if we use the 
curve in Fig. 1 to determine what spac- 
ing of the stimuli will mark off equal 
distances along the ordinate, no signifi- 
cant change in the spacing is called for. 
(The technique of experimental itera- 
tion is a powerful device for neutraliz- 
ing certain kinds of bias. For other ex- 
amples of its use see [57, 64, 66].) 
The iterated experiments on duration 
culminating in that shown in Fig. 1 con- 
verge on a category scale that is typi- 
cal of those obtained on continua of 
Class I (prothetic). We note that it is 
concave downward, as expected, and 
that it contrasts sharply with the ratio 
scale (triangles) determined by asking 
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Fic. 2. Psychometric functions from 50 
subjects obtained with recorded tests for abil- 
ity to discriminate the pitch and loudness of 
bands of noise. Each test contained 110 test 
items consisting of a standard noise (2 sec- 
onds) followed immediately by a comparison 
noise (2 seconds). The band of noise used in 
the pitch test was shifted up or down the fre- 
quency scale. Note that there is a “time 
error” for loudness but none for pitch. 
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12 subjects to estimate directly the ap- 
parent duration of the stimulus. The 
upward curvature of the ratio scale for 
subjective duration agrees with the re- 
sults of independent experiments in 
which the method of fractionation was 
used to determine the duration that ap- 
peared to be half as long as a standard 
duration (19, 50). 

The curves in Fig. 1 tell us that on a 
prothetic continuum the typical subject 
is unable to equalize the intervals on his 
category scale, even when instructed to 
do so. He fails for the basic reason 
that his ability to tell one magnitude 
from another varies over the scale and 
affects the width of his categories. Since 
he can easily tell 0.5 sec. from 1.0 sec. 
he tends to put them in different cate- 
gories; since he can only with difficulty 


. tell 3.5 sec. from 4.0 sec. he tends to 


put them in the same category. Any 
procedure that tries directly or indi- 
rectly to get the subject to partition a 
prothetic continuum into equal inter- 
vals seems bound to fail in the general 
case. 

For purposes of comparison, the func- 
tion obtained by counting up j.n.d.’s is 
plotted in Fig. 1 as a dashed curve. 
This curve, based on Woodrow’s (73) 
report that the relative j.n.d. for dura- 
tion is of the order of 10 to 12.5 per 
cent, shows the form Fechner’s law pre- 
dicts for the scale of apparent duration. 

3. Time-order error. The so-called 
“time-error” discovered by Fechner has 
been pursued in theory and experiment 
for many decades (23). It refers to the 
fact that the second of two equal stimuli 
tends to be judged greater than first. 
We have reason to believe that a sys- 
tematic time-order error (as Guilford 
prefers to call it) is characteristic of 
judgments on Class I continua. On 
Class II continua we neither expect it 
nor do we generally find it. 

This hypothesis was first suggested to 
me by the troubles we met when we 
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tried to supply the armed forces with 
recorded tests designed to assess pitch 
and loudness discrimination for bands 
of noise (29). Karlin and I discovered 
that it was easy enough to record a 
pitch test whose scoring was straight- 
forward, but how were we to score a 
loudness test when it always showed a 
time-order error? Were we to try to 
explain to the military that, although 
the second stimulus may actually have 
been 0.1 db lower than the first, if the 
soldier called it lower they should mark 
him wrong? We tried to get rid of the 
error by making the transition from the 
standard level to the variable level in- 
stantaneous, but the error was still there. 
We finally solved the issue by avoiding 
it: we dropped the stimulus items whose 
proper scoring was made ambiguous by 
the time-order error. 

Figure 2 shows results obtained with 
these tests from groups of 50 subjects 
each. (In the course of this work sev- 


eral hundred subjects were tested.) The 
crossing points of the psychometric func- 
tions show that there is no time-order 
error for pitch, but that for loudness the 
error amounts to about 0.3 db. Similar 
results were obtained when we used the 


Seashore pure-tone tests. Postman (43) 
then took up the problem and confirmed 
our finding that there is typically a 
time-order error with loudness but not 
with pitch. 

This perhaps is not the place to try 
to “explain” the time-order error, but I 
would like to venture an opinion about 
it. First of all, it is important to note 
that the error is typically small—a frac- 
tion of a j.n.d—and hence is close to 
the limiting “noise level” of our meas- 
urements. It is therefore not surpris- 
ing that the published findings resound 
with the noise of disagreement. At- 
tempts to map the course of the effect 
as a function of time are particularly 
discordant, which is good reason for giv- 
ing up the name “time-error.” Its de- 
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pendence on time is probably fortui- 
tous. As a matter of fact, in an ex- 
periment still in progress (by G. M. 
Shickman) we have obtained the usual 
“time-error” when the subject lifts the 
standard and the comparison weight 
simultaneously, one with each hand. 
What the Fechnerian time-order error 
really seems to depend on is the same 
basic process that makes the category 
scales on Class I (prothetic) continua 
turn out to be concave downward, 
namely, the asymmetry of sensitivity 
the fact that discrimination is better to- 
ward the low than toward the high end 
of the range. The methods commonly 
used to measure the time-order error 
are essentially similar to those we use 
in category scaling. The subject places 
stimuli in two or more categories like 
heavier, equal, or lighter and the forces 
that determine his judgments are the 
same as in any category scaling pro- 
cedure. (When the method of adjust- 
ment is used with loudness a different 
type of constant error occurs: the vari- 
able tends to be set higher than the 
standard [44], but the sign of the error 
may reverse when the standard is very 
loud [61].) The relation between the 
category scale and the conventional 
time-order error is perhaps best illus- 
trated by means of Fig. 3, where we see 
three category scales for lifted weights 
covering different ranges of stimuli. 
(For a further discussion of these and 
other such curves, see [64].) The 
widest range (Cowdrick [7]) gives the 
usual category curve, concave down- 
ward, and we note that the middle 
weight in the series is assigned a value 
above the middle of the category scale. 
When the range is shortened (Guilford 
and Dingman [25]) the same general 
features are preserved. The shortest 
range is the one used by Fernberger 
(11) to determine a difference limen by 
the method of single stimuli. This 
curve, like the others, fails to pass 
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Fic. 3. Category scales for lifted weight. 


On the middle curve only every other point has 


been plotted. The end stimuli were designated 4 and 12 but the subject could use other cate- 
gories if desired. Note that the curves are all concave downward. The fact that they do not 


pass through the point 0 indicates a “time error.” 


through the point 0, which is the point 
defined by the middle of the stimulus 
range and the middle of the category 


scale. The distance between the point 
O and the point where the curves cross 
the horizontal line through the middle 
of the plot is the time-order error (cf. 
Pratt [46]). It is as simple as that. 
By means of the theory of category 
judgments we can apparently account 
for Fechner’s time-order error without 
the help of such props as sinking traces, 
fading images, or assimilation. And it 


becomes clear why we should expect to 
find time-order errors on Class I con- 
tinua (prothetic) but not on Class II 
(metathetic). 

One further point. Not only is the 
time-order error produced by other fac- 
tors than time, but it also has nothing 
to do with order. Nor is it necessarily 
an error. The name, like the one given 
to the Holy Roman Empire, is a mis- 
nomer in all particulars. If my thought 
is correct, that we are dealing here with 
an effect on category judgments that 
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derives from an asymmetry based on 
the relativity of discrimination, a better 
name for it might be the “category ef- 
fect.” Such a label might also help to 
distinguish this form of bias from other 
sources of systematic “error,” of which 
there are many. Not all constant de- 
viations in psychophysical experiments 
can be blamed on the category effect. 
To take a single example, the observa- 
tion of Fullerton and Cattell (13) that 
the second of two equal light flashes ap- 
pears fainter than the first is probably 
due to an altered state of retinal adapta- 
tion produced by the first flash. In this 
instance an adaptation effect can appar- 
ently override the category effect and 
produce what would conventionally be 
called a positive time error. No doubt 
there are still other sources of system- 
atic bias in experiments of this sort. 

It is true of course that time may 
play a role in these and other system- 
atic errors, in the sense that the effects 
may vary with the interval between 
judgments. But our problem is first to 
try to discover the basic nature of the 
asymmetries involved in the various 
kinds of constant errors and then to see 
how the forces that produce them may 
vary with time. 

4. Hysteresis. Since this word means 
a lagging behind, as when magnetization 
lags behind the magnetizing current, it 
seems a good term to describe what hap- 
pens when the apparent sense-distances 
between successive stimuli are judged in 
different orders. The effect shows up 
in especially dramatic fashion in experi- 
ments on bisection and equisection. In 
a typical experiment on loudness the 
subject sits before a row of five keys 
which he presses to produce the tones 
(1,000 cps). The levels produced by 
the two end keys are fixed, 40 db apart, 
and the subject adjusts the leve! 
trolled by the intermediate keys in or 
der to divide a 40-db interval into four 
equal-appearing steps in loudness. 

Where the subject sets the levels de- 
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Fic. 4. Hysteresis effects in judgments of 
intervals on three sensory continua. The ar- 
rows indicate the order of stimulus presenta- 
tion. For brightness and loudness the subject 
adjusted the middle stimuli to divide the range 
into two or four equal-appearing intervals. 
For heaviness the subject lifted three weights 
and indicated the relative apparent position of 
the middle weight by means of a slider be- 
tween two markers on a bar. Five different 
middle weights were used in ascending and 
descending order. 


pends on whether he listens to the loud- 
nesses in ascending or descending order. 
I found that in the ascending order he 
sets the bisecting level some 5 to 8 db 
higher than in the descending order. In 
an experiment run to verify this effect, 
Garner (15) found an average discrep- 
ancy of 5.8 db. It is as though the 
loudness the subject hears lags behind 
what he should hear as he goes up or 
down the scale. As a result of this lag 
the graphs of the functions shown in 
Fig. 4 exhibit a “hysteresis loop”: the 
ascending path is different from the de- 
scending path. Please note, however, 
that in calling this phenomenon hystere- 
sis I am trying to describe it, not ex- 
plain it. I am not sure I know how to 
explain it. 

As shown in Fig. 4 the same hysteretic 
effect occurs in the bisection of bright- 
ness intervals and in judgments of the 
intervals between lifted weights. Partly 
hecause the range was shorter (30 db) 
ihe brightness bisections show less hys- 
teresis than loudness. Lifted weights, 
on the other hand, show relatively more. 
The possibility suggests itself that the 
magnitude of the hysteresis may vary 
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inversely with differential sensitivity. In 
judging the apparent size of the two in- 
tervals between three weights, the sub- 
jects moved the middle of three pointers 
on a steel bar to indicate the relative 
position of the middle weight. Thus we 
see that hysteresis is independent of 
whether the subject himself adjusts the 
stimulus. The same bar and pointers 
were also used for judgments of loud- 
ness, and the resulting hysteresis was 
essentially similar to that shown in Fig. 
4. The data in Fig. 4 are typical ex- 
amples from an extensive series of un- 
published experiments, some of which 
were run by R. J. Herrnstein. 

Hysteresis occurs on the three exam- 
ples of Class I continua that we have 
tested. The hypothesis I am suggesting 
is that it may occur on all continua of 
this class, but that it probably does not 
occur on continua of Class II. Evidence 
for this possibility is suggestive, but not 
yet conclusive. 

Bisections on the continuum of visual 
position, for example, seem to be free of 
hysteresis. At least what minor effect 
has been found is in the opposite direc- 
tion from that observed on Class I con- 
tinua. Volkmann kindly undertook to 
test this matter on the large viewing 
screen (21.6 X 6.8 ft.) at Mt. Holyoke 
College (6). The subject sat 35 feet 
from the screen and adjusted the posi- 
tion of the middle of three points of 
light that came on in the sequences 
right-middle-left and left-middle-right. 
The end lights were a meter apart. 
When the 12 observers were instructed 
to fixate the center of the screen, the 
two orders produced an average differ- 
ence in the bisection point of only 0.25 
cm. When the instructions were to look 
at each stimulus as it came on, the dis- 
crepancy was 3.6 cm. Since both dis- 
crepancies were in the opposite direc- 
tion from those observed with loudness, 
brightness, and lifted weights, we may 
conclude that, whatever constant errors 
may occur on this Class II continuum, 
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they apparently constitute a different 
phenomenon from the Class I hysteresis. 

The question of hysteresis in pitch 
judgments is less easy to answer. We 
found no evidence for hysteresis in the 
experiment that led to the construction 
of the mel scale (67), but the fact is 
we were not looking for it. In a more 
direct test of the issue I later had 10 
subjects bisect various pitch intervals in 
ascending and descending order. The 
experiment was casual and not defini- 
tive, and the results were not clear cut. 
For some subjects hysteresis seemed to 
be present, for others not. Two subjects 
happened to have “absolute pitch’”—the 
best cases I happen to have encoun- 
tered. Each of these two subjects set 
the middle tone to the same frequency 
on every trial, regardless of the order of 
listening. So it seems that at least some 
people show no hysteresis in their pitch 
judgments. But this is apparently not 


true of others. 
In a recent laboratory exercise, two 


of my students, D. D. Greenwood and 
M. L. Israel, found hysteresis in the 
equisections of all but one of nine sub- 
jects who divided the interval 400—7,000 
cps into four equal-appearing intervals. 

Incidental evidence for the absence of 
hysteresis in pitch bisections is found in 
an experiment by Cohen, Hansel, and 
Sylvester (5). They were concerned 
with another problem, but part of their 
procedure called for the bisection of the 
pitch distances between 1,000 and 3,000 
cps and between 2,000 and 4,000 cps 
in both ascending and descending order. 
The bisections were as follows: 


For 1,000—3,000 cps, going up: 1,874 
cps; going down: 1,838 cps. 

For 2,000-4,000 cps, going up: 2,693 
cps; going down: 2,808 cps. 


The average standard deviation for all 
settings was 303 cps. 

We see that the interval 1,000—3,000 
cps shows an insignificant discrepancy 
(36 cps) and that the larger discrepancy 
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(115 cps) shown by the interval 2,000- 
4,000 cps is less than half the standard 
deviation. It is also important to note 
that the directions of the discrepancies 
between the results for ascending and 
descending orders are opposite in the 
two parts of the experiment. Inciden- 
tally, the points of bisection that are 
predicted by the mel scale are 1,859 cps 
and 2,280 cps, which are within 3 per 
cent of the observed values. 

All in ali then, the evidence for hys- 
teres'* in pitch bisections is ambiguous. 
The pitch continuum probably means 
different things to different people, es- 
pecially to those with and without “ab- 
solute pitch.” It is in many ways a 
difficult continuum about which to gen- 
eralize (see, for example [64]). 

Another point concerning bisection de- 
serves mention. On continua of Class I 
(prothetic) the point of bisection falls 
consistently below the point predicted 
by the ratio scale of the subjective mag- 
nitude (39, 58, 63). This is true of the 
combined results from which hysteresis 
has been eliminated by an averaging of 
the ascending and descending judgments. 
It is also clearly evident in the early ex- 
periments by Fullerton and Cattell (13) 
who explored the “force of movement” 
by the methods of halving, doubling, 
and bisection. This error in bisection 
is scarcely surprising when we consider 
that bisection and category rating-scales 
have much in common. In both in- 
stances the subject tries to equalize in- 
tervals, and in both instances he misses 
the mark in the same direction—pre- 
sumably because both kinds of judg- 
ments are controlled partly by discrimi- 
nation, and discrimination varies from 
one end of the continuum to the other. 

On continua of Class II (metathetic), 
on the other hand, the results of bisec- 
tion agree with those obtained by frac- 
tionation and by magnitude estimation. 
This has been demonstrated for pitch 
(64, 67) as well as for inclination or 
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“bearing” (47, 64). Apparently it is 
only on Class II continua that the con- 
struction of a scale of subjective mag- 
nitude by the method of bisection leads 
to a valid outcome, for only there can 
bisection be shown to agree with direct 
magnitude estimation. 

In our experiments on weight and 
loudness: in which the subject moved a 
pointer slong a bar, the parallel with 
categ »y rating-scales is even more ob- 
vious. In effect, the bar and pointer 
constitute a continuous rating scale, 
analogous to the widely used scales on 
which «he rater marks a point on a line 
with a pencil. The bar and pointer also 
give results (hysteresis averaged out) 
that differ from those predicted by the 
ratio scale of the Ciass I continuum. 

We see then that possibly four func- 
tional criteria can be used to divide the 
perceptual continua into two general 
classes. Clear examples of each of these 
classes are easy to find, but this is not 


to say that all examples one might think 
of will prove easy to classify. When we 
try to impose a simple order on the com- 
plex fabric of our experience there is an 
ever-present risk that we will oversim- 


plify what is not simple. Class I (pro- 
thetic) constitutes a rather unitary and 
well behaved group of continua, but 
Class II (metathetic) includes various 
disparate aspects of things, many of 
which do not fall naturally onto unitary 
monotonic scales, and some of which 
may not even be orderable. Some con- 
tinua may turn out to be hybrid mix- 
tures of both Class I and II. Pitch, in 
fact, may be one of these (64). 

In most of what follows we will limit 
our concern to the prothetic continua 
(Class I), the “quantitative” aspects of 
things, for it is here that we can most 
profitably examine the principles that 
psychophysics has set itself to devise. 
The first of these principles concerns the 
relation between stimulus and sensation. 
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THE Power FUNCTION 


There is a growing bundle of evidence 
to indicate that on prothetic continua 
the form of the “psychophysical law” 
is a power function. Most of this evi- 
dence, accumulated since 1930, is the 
product of a reoriented psychophysics 
that brings “direct” methods to bear on 
the Fechnerian query regarding the re- 
lation between stimulus and sensation. 
These direct methods lead to ratio scales 
of perceptual magnitude, and to a first 
approximation these scales show that 
the psychological magnitude is a power 
function of the stimulus magnitude. 

The basic principle seems to be that 
equal stimulus ratios tend to produce 
equal sensation ratios. Empirical evi- 
dence to support this principle has been 
obtained for at least a dozen perceptual 
continua, with the aid of several differ- 
ent experimental methods. What the 
principle entails can be illustrated by 
examples such as these: In order to 
make one sound seem half as loud as 
another, the physical energy must be 
reduced by about 90 per cent (10 db), 
and this required reduction is approxi- 
mately the same regardless of the level 
from which we start. The same is true 
of brightness, and the required reduc- 
tion is of the same order of magnitude 
as for loudness. In order to make one 
lifted weight seem half as heavy as an- 
other, the original weight must be re- 
duced by about 38 per cent, and this 
percentage reduction is approximately 
constant over a wide range of stimuli. 
The approximate constancy of the per- 
centage reduction corresponding to a 
given subjective ratio demonstrates that 
the sensation y is proportional to the 
stimulus S raised to a power n. Thus 


wy = RS". 


When we convert this equation to loga- 
rithmic form we obtain a linear equa- 
tion which has a certain practical use- 
fulness, for the function can then be 
represented by a straight line in log-log 


coordinates. The slope of this line cor- 
responds to the exponent n. 

When we know the value of ratio r 
between the stimulus judged half and 
the standard stimulus we can obtain the 
value of m by dividing log r into log 
0.5. In general, if s is the sensation 
ratio (determined by the method of 
ratio production) and r is the corre- 
sponding stimulus ratio, then 


_ logs 


~ logr 


It is not to be expected, of course, 
that a simple power function will hold 
from zero to infinity. What the limits 
may be must be decided by experiment. 
We know that some functions relating 
stimulus to response exhibit discontinui- 
ties (e.g., apparent numerousness and 
flash rate), and we know that our for- 
mulas might need eventually to take ac- 
count of second-order perturbations in 
the vicinity of the absolute threshold 
(cf. 8, 58). But our present concern is 
with first-order approximations to gen- 
eral laws. 


METHODS FOR RATIO SCALING 


Before we consider the concrete in- 
stances of ratio scales, let us look briefly 
at the problem of procedure. Methods 
for constructing ratio scales of subjec- 
tive magnitude are a relatively new de- 
velopment. It is true that Merkel (31) 
with his Methode der doppelten Reize 
(1888) tried to find the stimulus that 
appeared to double the sensation, but 
Merkel’s effort seems to have had little 
influence on psychophysics. Fullerton 
and Cattell (13), who used the meth- 
ods of doubling and halving, fared little 
better. Even Titchener (72) in his thor- 
ough and scholarly Instructor’s Manual 
makes only off-hand allusion to Merkel’s 
method of “doubled stimuli.” And of 
the author of the method he says “we 
are reminded that he is, by mental con- 
stitution, a physicist rather than a psy- 
chologist . . . and that his work as a 
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whole is exceedingly poor in introspec- 
tive data” (p. 223). It may be that 
“mental constitution” plays a role in 
these things, as Titchener suggests. I 
wonder what he would make of the fact 
that at least seven different physical 
laboratories have made important con- 
tributions to the development of the 
ratio scale of subjective loudness—the 
sone scale—as against perhaps three 
different psychological laboratories. 
The impetus for the development of 
a method is a problem. Method for its 
own sake, tempting and fascinating as 
it may be, often leads to littl—except 
methodology. Physicists and psycholo- 
gists went to work to refine the pro- 
cedures for measuring loudness mainly 
because the substantive outcome was of 
interest and importance to someone, par- 
ticularly to acoustical engineers. This 
is shown by the fact that some of the 
earliest studies were paid for by com- 
mercial companies. Curiously enough 
the practical problem originated in an 
obvious failure of Fechner’s law. Not 
long after they had adopted the decibel 
scale for measuring sound intensities, 
the engineers noted that equal steps on 
the logarithmic (decibel) scale do not 
behave like equal steps, for a level 50 
db above threshold does not sound at 
all like half of 100 db, as Fechner’s law 
implies it should. Since the acoustical 
engineer must often interpret to his cus- 
tomers the meaning of esoteric acousti- 
cal measurements, it soon became clear 
that a scale was needed on which the 
numbers would be proportional to how 
loud things sound to the typical listener. 
Without the motive of this kind of prac- 
tical problem to spark the development 
of method, it is a fair guess that the 
ratio scales for the fourteen perceptual 
continua we will examine might never 
have been constructed. How a subjec- 
tive ratio scale, once achieved, can be 
put to work is shown by the way we 
were able to use the sone scale in the 
development of a procedure for calcu- 
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lating the loudness of a complex noise 
from a spectral analysis of the sound 
(61). 

The methods for constructing ratio 
scales are still in the process of develop- 
ment, but in one form or another they 
all require subjects to make quantitative 
estimates of subjective events. Many 
authors have screamed that this is non- 
sensical, meaningless, and impossible 
(12, 34), but those who follow these 
methods go ahead and do it anyhow. 
These direct assessments of sensation 
seem not so impossible after they have 
been made. 

At present the methods are princi- 
pally four, but each has subvarieties. 
We may classify the methods more or 
less systematically as follows: 


1. Ratio estimation 
a. Direct judgment of ratios 
b. “Constant-sum” 
. Ratio production 
a. Fractionation 
b. Multiplication 
. Magnitude estimation 
a. Prescribed modulus 
b. No designated modulus 


4. Magnitude production 


I am well aware that few activities in 
psychology exhibit greater terminologi- 
cal confusion than the naming of meth- 
ods, and that there is probably no effec- 
tive cure for the burgeoning, ramifying 
process that occasions the welter of 
names. The best we can do is take 
stock from time to time and attempt 
a little systemization. Having formerly 
contributed my full share to the con- 
fusion, in the foregoing list I have tried 
for a dose of order. 

Partly through my efforts the second 
class of procedures, which was the first 
used historically, got the name frac- 
tionation (68), because the usual pro- 
cedure has been to require a subject to 
set one stimulus to produce a sensation 
half as great as a given standard. Frac- 
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Fic. 5. Results of adjusting a loudness ratio 

to equal an apparent brightness ratio. The 

subject viewed a pair of luminous circles of 
which one was dimmed by an amount shown 
on the abscissa. He also pressed a pair of 
keys to produce two different levels of white 
noise. He adjusted one level (ordinate) to 
make the loudness ratio seem equal to the 
brightness ratio. The brighter light was about 

99 db re 10°” lambert and the louder noise 

was about 92 db re 0.0002 microbar. 





tions other than a half are also used 
with consistent results. But fractiona- 
tion in this sense is only one facet of 
a more general procedure. The other 
facet, which we might call multiplica- 
tion, involves the complementary process 
of requiring the subject to identify or 
produce a prescribed ratio that is greater 
than one, e.g., to double, triple, etc., a 
given standard. This procedure has not 
been used very much, probably not as 
much as it should be, for there is abun- 
dant evidence (62) to indicate that, 
when it is made to complement halving, 
the procedure of doubling can help to 
balance out certain systematic biases. 
The two procedures together may be 
called ratio production. The production 
of ratios can be carried out by a variety 
of different techniques. Thus the ex- 
perimenter may allow the subject to ad- 
just a stimulus to produce a prescribed 
ratio to a standard, or the experimenter 
may set the stimulus and ask the sub- 


ject whether it meets the prescribed 
ratio (method of “constant stimuli’). 

An interesting variant of ratio produc- 
tion involves the fixing of two lumi- 
nances to define an apparent ratio which 
the subject is to reproduce by setting 
two loudnesses to the same apparent 
ratio. In an experiment by J. C. Stevens 
in this laboratory the subjects set nearly 
the same physical ratio (decibels) be- 
tween the intensities of the white noises 
as the experimenter set between the in- 
tensities of the two white surfaces. Fig- 
ure 5 shows the median settings made 
by 15 subjects, each of whom twice ad- 
justed the loudness of the second of two 
noises until the apparent ratio between 
the noises equaled the apparent ratio 
between two luminous targets viewed 
against a dark surround. The subject 
sat about 4 feet from the targets, which 
were 4.4 cm. in diameter and separated 
by about 17.5 cm. The fact that the 
median settings of the variable noise 
fall fairly close to the line indicates that 
the subjective impressions produced by 
white light and white noise are simi- 
lar functions of stimulus intensity. The 
adjustments made to the largest ratio 
(40 db) were of course quite variable: 
the semi-interquartile range was about 
10 db.? 

The method of ratio estimation is the 
inverse of ratio production. Instead of 
prescribing the ratio in advance, the 
experimenter presents two (or more) 
stimuli and asks the subject to name 
the ratio between them. The subject 
may name the ratio directly, as in the 
pioneer experiment of Richardson and 
Ross (49), or he may be constrained to 
express the ratio by dividing a given 
number of points between the two 
stimuli in the manner proposed by Met- 
fessel (32). The constraint involved in 
the so-called “constant-sum” method has 


2For the rationale of measuring luminance 
in decibels see (59). 
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obvious disadvantages when large ratios 
are involved. 

The method of magnitude estimation 
(60) dispenses with ratios as such and 
requires the subject to assign numbers 
to a series of stimuli under the instruc- 
tion to make the numbers proportional 
to the apparent magnitudes of the 
sensations produced. The experimenter 
may prescribe a modulus by presenting 
a given stimulus and calling it some par- 
ticular value, e.g., 10, or he may leave 
the subject free to pick his own modu- 
lus. (Note: if the magnitude estima- 
tions give a skewed distribution, as they 
usually do, it is advisable to caiculate 
medians instead of means.) 

The method of magnitude estimation 
has a logical inverse in magnitude pro- 
duction—a possible method that has 
thus far been mostly ignored. Instead 


of presenting a series of stimuli in ir- 
regular order and asking the subject to 
judge their apparent magnitudes, the ex- 
perimenter might name various magni- 


tudes and ask the subject to adjust 
stimuli to produce proportionate subjec- 
tive values. Like any given procedure, 
this method probably has its “bugs” and 
biases and it might be interesting to. 
find out what some of them amount to. 

One thing we know is that in using 
this method the experimenter must re- 
sist whatever impulse he may have to 
fix or designate the top and bottom of 
the range, for this will change the prob- 
lem into one involving category scaling. 
In some earlier experiments (64) we 
used this procedure, which we may call 
category production, to generate a 7- 
point category scale of loudness. We 
presented two levels which we desig- 
nated 1 and 7 and then asked subjects 
to produce each of the other categories 
in irregular order. The results are like 
the usual category judgments obtained 
on Class I continua: the function is 
concave downward when plotted against 
the sone scale. 
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Magnitude production of a sort has 
been used in an experiment in which we 
used a brightness rather than a number 
to designate a magnitude. We set the 
luminance of a single target to various 
levels and asked subjects to adjust a 
noise to make its loudness appear as 
great as the brightness of the light. Al- 
though this investigation is still in prog- 
ress (by J. C. Stevens) the results ap- 
pear to be fairly consistent with what 
we know about the subjective scales for 
loudness and brightness. The intensities 
of the white noise are made roughly pro- 
portional to the intensities of the white 
light, again suggesting that loudness and 
brightness are similar functions of in- 
tensity. 

All four of the methods listed above 
provide the kind of data necessary for 
the construction of a ratio scale (55). 
Each method can of course be varied 
and modified in numerous ways. Not 
only do the methods need to be altered 
and adapted to fit the problem at hand, 
but in any serious effort to establish a 
definitive scale for a given perceptual 
continuum it is desirable to ferret out 
the possible sources of bias by using dif- 
ferent methods and by altering relevant 
parameters. In the present state of the 
art a valid scale that is representative 
of the typical subject can scarcely be 
hoped for from a single try. 

The following ratio scales of subjec- 
tive magnitude have been erected by 
means of one or more of the general 
procedures named above. They have 
not all been given the intensive study 
and cross-checking they deserve, but our 
interest here is in their general form 
rather than in their detailed adequacy. 
To a first approximation they are all 
power functions. 


FouRTEEN RATIO SCALES 


Our review of the fourteen examples 
of ratio scales of subjective magnitude 
will be brief, because most of the details 
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PERCEPTUAL CONTINUA ON WHICH PsYCHOLOGICAL MAGNITUDE 
1s A POWER FUNCTION OF THE STIMULUS 


The second column shows the approximate exponent of the power function. Names suggested for 
the various subjective units are listed in Column 3. The methods used (Column 4) are indicated 
by numbers that refer to the methods listed elsewhere in this paper. 
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Visual length , mak 
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Lightness of grays 
Finger span 
Numerousness 
Heaviness 
Auditory flutter 
Visual velocity 
Visual flash rate 
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*S. S. Stevens and Geraldine Stone. Research in progress. 


are available elsewhere. We can help 
the cause of brevity by means of Table 1 
where the significant facts are listed and 
the sources of more detailed information 
are tabulated. In Table 1 the continua 
are ordered by increasing size of the ex- 
ponent of the power function, and when- 
ever a name for the subjective unit of 
the ratio scale has been suggested it is 
recorded. The methods that have been 
applied to each scale are indicated by 
numbers that refer to the listing in the 
previous section. I think all these con- 
tinua belong to Class I, but this of 
course may be open to question. 

Since Table 1 was prepared, an ex- 
periment on smell has been started in 
this laboratory by T. S. Reese. The 
early results suggest that the apparent 
intensity of the odor of benzaldehyde 
(synthetic almond) is a power function 
of the concentration in moles per liter 
of air. The exponent seems to be of the 
order of 0.2. 

It goes without saying that all these 
scales have not been determined with 


la, 2a, 2b, 3a, 3b, 4 (58) 

2a, 2b, 3a, 3b | (28), (64) 

2a (17) 

2a | (1) 

la, 2a, 3a (8), (47), (64) 
la, 1b, 2a (8), (47) 

2a, 3a (19), (S50), 64) 
3a, 3b (64 

3a, 3b ° 





numer | 2a (69) 


la, 1b, 2a, 2b, 3a, 3b| (64) 
2a, 2b | (42) 
2a (9) 

2a (48) 








equal care and thoroughness. About 
some of them we know a lot and have 
reason for confidence in their form; 
others have received only limited at- 
tention. Some of these attributes are 
easy to scale; others are more resistant 
to precision. Some of them represent 
perceptions about which we frequently 
make judgments of a more or less quan- 
titative sort and about which the sub- 
jects in an experiment know, or think 
they know, so much about the stimulus 
that the “stimulus error” (3) may be 
a ready danger; others involve stimuli 
whose physical parameters are so un- 
familiar to the typical subject that his 
temptation to try to estimate a physical 
measure of the stimulus, rather than to 
judge how it appears, is remote and un- 
compelling. Yet all these ratio scales 
approximate power functions. 

I say approximate because it is cer- 
tain that not all judgmental continua 
are strictly power functions of an arbi- 
trary physical parameter of the stimu- 
lus. We know, for example, that over 
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most of the range the loudnesses of a 
1,000-cycle tone and a white noise grow 
with intensity according to similar laws, 
but at very low intensities the loudness 
of the white noise grows more rapidly. 
Hence the loudnesses of noise and tone 
cannot both follow power functions 
throughout their entire range, and it is 
even doubtful that either is strictly a 
power function at better than a first- 
order approximation (62). 

On this question of the form of a psy- 
chophysical law we ought perhaps to try 
to learn a lesson from Fechner’s mistake 
and forego the temptation to insist on 
the exactitude of a particular formula. 
In the realm of human judgment we 
may aspire to the discovery of first-order 
invariances, but it would be idle to pre- 
tend that precision can be pushed to the 
level envisaged by Fechner. The judg- 
ment of subjective magnitude is inher- 
ently a “noisy” phenomenon. When 
people try to describe a sensation in 


quantitative terms they face a difficult 
task, and the factors that affect the 


outcome are numerous and subtle. Pa- 
tience and experimental skill can prob- 
ably clean up part of the variance, but 
there will always remain irreducible dis- 
persions to set a level below which we 
sink into uncertainty. In the broader 
perspective of things the power law 
stands out as a first-order relation, just 
as Weber’s law describes the first-order 
relation between magnitude and resolv- 
ing power. Although we are sometimes 
more fascinated by second-order depar- 
tures from first-order relations than we 
are by first-order relations themselves, 
the first-order generalities have the 
broader significance. It is of greater 
moment to know the first-order law for 
the velocity of falling bodies than it is 
to prove that a host of variables can 
produce departures from the law. 
Brief comments on some of the scales 
listed in Table 1 are now in order. We 
have already discussed loudness in vari- 
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Fic. 6. Median magnitude estimations for 
loudness and brightness. For loudness each of 
32 subjects made two estimates of each level 
(1,000 cps) in irregular order with no desig- 
nated standard. Estimates were transformed 
to a common modulus at the 80-db level. For 
brightness each of 28 dark-adapted subjects 
made two estimates of each level in irregular 
order. The target subtended about 5° and 
was illuminated for about 3 sec. at each pres- 
entation. Once at the beginning of each ses- 
sion the subject was shown the level 70 db (14 
subjects) or 80 db (14 subjects) and told to 
call that level 10. Estimates for all subjects 
were transformed to a common modulus at 
70 db. 


ous connections, and a review of most 
of the work on the problem is available 
(58). An example of a segment of the 
loudness function obtained from 32 sub- 
jects by the method of magnitude esti- 
mation is shown in Fig. 6. It is inter- 
esting that the sone scale is, so far as I 
know, the only subjective ratio scale for 
which the American Standards Associa- 
tion has appointed a committee to rec- 
ommend a standard form. 

Brightness is still under active study. 
For ordinary luminous targets seen 
against a dark surround by a dark- 
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adapted observer, brightness seems to 
grow approximately as the 0.3 power of 
the luminance. Figure 6 shows mag- 
nitude estimations for brightness ob- 
tained from 28 subjects in an experi- 
ment being conducted by J. C. Stevens. 
For very small targets that approxi- 
mate point sources the exponent is 
larger than that determined by the line 
in Fig. 6, and of course the state of 
adaptation of the eye and the presence 
or absence of contrast makes a differ- 
ence to the brightness function. These 
two factors increase the size of the ex- 
ponent, as can be seen in Table 1 where 
the exponent for lightness of gray pa- 
pers is listed as 1.2. When grays of dif- 
ferent reflectances are viewed under ordi- 
nary illumination, light adaptation and 
contrast both operate to increase the 
steepness of the brightness function to 
a different order of magnitude from that 
observed with luminous targets seen in 
the dark. 


Visual distance refers here to the ap- 
parent distance from the observer to a 
marker on a 200-foot line that stretched 
away from the observer on a large flat 


lawn. Two subjects made fractionation 
judgments. The combined results can 
be fitted by a % power law as well as, if 
not better than, by the formula pro- 
posed by Gilinsky (17). Specifically, 
the data in her Fig. 6 fall close to a 
straight line. The main difficulty with 
the fit occurs at the shorter distances, 
where for one observer the apparent half 
values are nearly equal to the objective 
half values. But we would expect ap- 
parent distance to approximate objective 
distance when the angle between the 
line on the grass and the observer’s line 
of regard becomes large. Perhaps if the 
line went out from the bridge of the ob- 
server’s nose we would obtain a power 
function all the way—with an exponent 
less than one. Of course apparent dis- 
tance, like apparent size, is specific to 
the experimental situation and is af- 
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fected by cues of various sorts. It is 
not to be expected, therefore, that a 
single power function will govern all 
judgments of distance. As a matter of 
fact, in a very different set-up involving 
short distances (4.5 meters and less) 
Gruber (21) obtained data that fit a 
power function with an exponent of ap- 
proximately 1.3. 

Taste, curiously enough, gives an ex- 
ponent of approximately unity when the 
stimulus measure is the ratio of solute 
to solvent (1). The functions reported 
for four different taste substances vary 
somewhat from one to another, for taste 
is a troublesome modality to work with, 
but they all give fairly straight lines in 
a log-log plot and the slopes hover 
around the value one. Another experi- 
ment (on sweetness [30]) gave only a 
rather poor approximation to a power 
function. 

Length and duration give subjective 
ratio scales that are nearly proportional 
to the physical scales on which we usu- 
ally measure the stimulus. On balance, 
however, there is a tendency in both 
cases for the exponent of the power 
function to be greater than one, which 
means that on these continua the mag- 
nitude of the subjective impression is 
a slightly accelerating function of the 
stimulus magnitude. The accelerating 
function for duration is shown in Fig. 1. 

Apparent area seems to be nearly pro- 
portional to physical area, but the shape 
and size of the area to be judged seem 
to produce second-order differences. For 
smaller squares the exponent appears to 
be greater than for larger squares. In 
a recent experiment with circles the ex- 
ponent turned out to be less than one 
(8). 

For continua such as length it is 
sometimes held that learning plays a 
major role: we learn what length meas- 
ures half as long as another, and so it 
comes to look half as long—which is one 
way of explaining why the exponent of 
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the power function is near unity. This 
may or may not be true. Of course, if 
by learning we mean that the organism 
has had a history, we can never hope to 
experiment on subjects who have not 
learned. How the organism got to be 
the way it is will always be an engaging 
problem, but it is also a worthy object 
of attention to discover just how the 
typical subject now behaves, regardless 
of how he got that way. 

The “learning explanation” of these 
ratio scales seems much less convinc- 
ing for such continua as loudness and 
brightness. Despite Ebbinghaus’ argu- 
ment (see [60]) that we learn the mean- 
ing of a doubled brightness by illumi- 
nating the scene with two lights instead 
of one, the typical viewer actually re- 
quires some nine or ten times the illumi- 
nation before he reports the brightness 
doubled. Why is this? And why is it 
that when the reflectance of gray pa- 
pers is involved, where it is scarcely con- 
ceivable that the typical viewer knows 
the nature of the physical processes at 
work in the stimulus, the exponent is 
greater instead of much less than unity? 
Sophistication as regards the stimulus 
seems less than a likely explanation. 

It is true, however, that one can learn 
to estimate the physical magnitude of a 
stimulus, as a good photographer well 
knows. He corrects what he perceives 
by what he knows about light and its 
relation to subjective impressions. By 
a similar procedure the acoustical engi- 
neer may estimate decibel levels with 
fair precision. He learns what a given 
subjective impression means in terms of 
the indications on a level meter. I have 
tested at least a half dozen acoustical 
engineers whose business involves the 
estimation and measurement of decibel 
levels. Despite their familiarity with 
decibels their subjective scales of loud- 
ness are not systematically different 
from those of other listeners. They can 
tell me, for example, both that one 


169 


sound seems about half as loud as an- 
other, and also that the one is about 
90 db and the other about 100 db. 
Let us return now to Table 1. The 
ratio scale for finger span was obtained 
as part of a program that is still in 
progress. Subjects judged the apparent 
thickness of blocks of wood placed for 
about a second between the thumb and 
middle finger. The magnitude estima- 
tions obtained show that over a range of 
from about 0.2 to 6.0 cm the impres- 
sion of felt thickness grows as a power 
function of the thickness of the blocks, 
and that the exponent is about 1.3. 
Numerousness refers to the impression 
one gets from looking at a collection 
of dots that contains too many to be 
counted. The exponent of about 1.34 
was derived from Taves’ experiment in 
which he used the method of fractiona- 
tion. I computed the exponent from 
the mean fraction judged half for all 
standards above 8 dots. The exponent 
is equal to log 0.5 divided by the loga- 
rithm of the fraction judged half. 
Lifted weights have been scaled by 
several methods in several laboratories 
and most experimenters have found close 
approximations to power functions. The 
value of 1.45 for the exponent of the 
veg scale was obtained from pooled 
results from five different laboratories. 
For the procedures involved, see (64). 
It is interesting to note that when Ful- 
lerton and Cattell in 1892 had their 
subjects try to halve and double the ap- 
parent force exerted on a dynamometer 
they came quite close to the values that 
would be prdicted by the veg scale. 
The two values obtained fell on oppo- 
site sides of the predicted value, and 
the largest discrepancy was about 10 
per cent of the predicted value. Al- 
though these data are meager they sug- 
gest that the apparent force of this type 
of muscular pull may follow a power 
function that is like the veg scale. 
Auditory flutter refers to the appar- 





170 


ent rate at which a white noise is turned 
on and off. Over a wide range of flut- 
ter frequencies (5 to 200 per second) 
the apparent rate of flutter grows as 
the 1.7 power of the frequency of stimu- 
lus interruption. Pollack used both frac- 
tionation and multiplication in deter- 
mining this function. The one value he 
tested at a rate below 5 per second sug- 
gests that there may be a discontinuity 
in the function at about this point. 
Otherwise the data give a close fit to a 
power function. 

Velocity of seen movement gave data 
that fit a power function so well that, 
as the authors say, “The goodness of 
fit may be judged as excellent.” The 
subjects alternately viewed two horizon- 
tally moving targets consisting of verti- 
cal white and black bands, and they ad- 
justed the velocity of one to make it 
appear half the other. The standard 


velocities ranged from 1.8 to 42 mm/sec. 


Ekman and Dahlback (9) call atten- 
tion to another interesting fact, namely, 
that when the subjects were asked to 
set one velocity equal to a standard 
velocity the variability of the settings 
was not very different from the varia- 
bilities obtained when one velocity was 
set to appear Aalf of a standard. They 
say, “the degree of uncertainty in frac- 
tionation is only a little greater than 
that in equating stimuli.” Hanes (28) 
had earlier noted the same thing in his 
experiments on brightness. He says, 
“Extremely interesting is the fact that 
the standard deviations around the group 
means of the equality matches . . . are 
of the same order of magnitude as the 
deviations around the means of the } and 
4 estimates at the same brightness lev- 
els.” I can add for loudness that the 
variability of halvings and doublings is 
actually slightly less, in general, than 
the variability of equality matches made 
between two noises of substantially dif- 
ferent spectra, say, a hiss and a rumble. 
In experiments involving several thou- 
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sand loudness matches (61) the stand- 
ard deviations ranged from 1.5 db, for 
very similar spectra, to 9.0 db for dis- 
parate spectra. Comparable variabili- 
ties for 36 subjects who halved and 
doubled the loudness of a white noise 
averaged 4.2 db (45). For pure tones, 
Garner (14) reports variabilities for 
halving that range from about 3 to 6 db. 

Visual flash rate gives the largest ex- 
ponent yet encountered. For rates be- 
tween 5 and 10 per second the ex- 
ponent is about 2.0. Actually the sub- 
jective scale for flash rate seems to 
break up into two segments: one for 
slow rates where the subject probably 
judges time elapsed between flashes, and 
one for faster rates (greater than about 
5 per second) where the subject seems 
to judge rapidity of flash. Except near 
the transition point, both segments of 
the scale approximate power functions 
with slopes decidedly greater than unity. 

This completes the list. Let us now 
look briefly at some of the antecedents 
of our power law. 


SomE ANTECEDENTS 


The idea that the relation between 
stimulus and perceptual response might 
be a power function is no new thing. 
What is new is the devising of experi- 
ments to demonstrate the fact. 

The first recorded notion that a power 
function might be involved appears to 
be the conclusion reached by Plateau 
(41) in the 1850’s—before the appear- 
ance of Fechner’s Elemente (1860). 
Plateau reasoned that, since the appar- 
ent relations among different shades of 
gray remain sensiblement le méme when 
the general illumination is changed, the 
ratios among the sensations produced. by 
the grays must remain fixed. This, he 
said, and quite rightly, is more rational 
than Fechner’s view (later revived by 
Wright [75]) that it is the differences 
that remain constant. Plateau’s as- 
sumption entails a power function, 
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Fechner’s a logarithmic function. By 
the present evidence, Plateau was right 
and Fechner was wrong. Sad to re- 
late, however, Plateau later changed his 
mind, and for a reason we now can see 
was irrelevant to the issue: Delboeuf’s 
equisection experiment did not yield a 
power function. And Fechner, crafty 
polemicist that he was, disposed of Pla- 
teau’s power function by the equally 
irrelevant argument that its champion 
had repented. 

But the power function came back 
to plague Fechner by another route. 
Brentano (4) suggested that an increase 
in a sensation is just noticeable when 
the increase is a fixed proportion of the 
original sensation. Brentano did not 
explore the mathematical consequences 
of this introduction of a kind of “Web- 
er’s law” into the realm of sensation, 
but Fechner did the mathematics for 
him (10) and showed that, if Weber’s 
law holds for both the sensations and 


the stimuli, the consequence is a power 


function. This way of getting to a 
power function was later argued for 
by others, particularly Grotenfelt (20), 
but since there is scarcely any way to 
prove the basic assumptions involved, 
the cause was lost under Fechner’s mas- 
sive attacks upon it. 

A third approach to the power func- 
tion was devised by Guilford (22, 23), 
who called it the “nth-power law.” It 
is based on the Fechnerian premise that 
j.n.d.’s are subjectively equal and that 
the scale of sensation can be had by 
summing them up. Guilford’s new 
twist was to alter Weber’s law by mak- 
ing the stimulus increment AS pro- 
portional not to S$ but to a power of 
S. A Fechnerian integration then yields 
a power function—which demonstrates 
that we can sometimes reach a correct 
conclusion by starting from two wrong 
assumptions. The assumption that AJ 
is a power function of J, with an ex- 
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ponent lying between 0.5 and 1.0, is 
not very easily justified by the facts. 
It is not so easily justified as the more 
plausible assumption that AS = k (S + 
d) where & and d are constants. This 
latter formula fits the data Guilford 
himself cited better than does the power 
function, and there is other evidence to 
support it (33). 

But it is Guilford’s other assumption 
that is the greater source of trouble. 
How serious is the consequence of as- 
suming the equality of j.n.d.’s becomes 
clear if we begin with any of the power 
functions (Table 1) whose exponent is 
greater than 1.0 and follow Guilford’s 
development in the reverse direction. 
We reach the startling conclusion “that 
AS would be a decreasing function of 
S, which is unheard of in psychophysi- 
cal research” (24). 

In the light of what we now know, it 
would appear that of the three histori- 
cal approaches to the power function 
Plateau’s was based on the most valid 
assumption. But unfortunately Pla- 
teau’s premise was only an assumption, 
and its empirical verification had to 
wait for nearly a hundred years. 

Our position now is more fortunate. 
The road to the formulation of a gen- 
eral psychophysical law need no longer 
be strewn with assumptions about j.n.d.’s 
or contrast steps. The principle I am 
proposing, that equal sensation ratios 
are produced by equal stimulus ratios, 
is merely the summary statement of 
what we observe on at least a dozen 
perceptual continua. The power func- 
tion which this simple principle entails 
appears to be the first-order relation 
between stimulus and response. Second- 
order departures from this law are cer- 
tain to exist, and their exploration may 
sometimes prove interesting and instruc- 
tive, but the wide invariance of the first- 
order relation deserves to remain the 
matter of first importance. 
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THE Sus fective S1zE OF THE J.ND. 


The issue raised by Fechner when he 
assumed that each j.n.d. represents a 
constant increment in sensation is a 
stubborn and vexatious problem. It is 
equivalent to the assumption that meas- 
ures of resolving power provide equal 
units that can be used as a measure of 
magnitude. Applied to Class I (pro- 
thetic) continua the assumption is dead 
wrong, but how can we prove it? Or 
rather, how can we make the proof per- 
suasive and convincing? Proof has been 
offered in the past (33, 38, 52) but 
ingenious counter-arguments have been 
marshaled against the evidence. Piéron 
(40) is the objector most persistent and 
erudite, but others join the chorus. It 
seems so simple and sensible to take the 
j.n.d. as a unit of sensation. How can 
it not be proper? 

It is improper simply because it is 
wrong. Let us consider a concrete case. 
Both fractionation and magnitude esti- 
mation show what is quite obvious to 
begin with, namely, that the apparent 
length of a line seen against a homo- 
geneous background is very nearly a 
linear function of physical length. Two 
inches look about twice as long as one 
inch. So we know that subjective length 
is related to physical length by a power 
function whose exponent is close to 
unity. Now what about the j.n.d., the 
resolving power? We can easily tell 
visually when one centimeter is added 
to one centimeter, but we can scarcely 
tell when a centimeter is added to a 
meter. So resolving power must be 
relative, or approximately so. As Miin- 
sterberg (37) showed, Weber’s law holds 
fairly well for length. Now we put 
these two facts together and what hap- 
pens? We see immediately that in 
terms of their psychological magnitude, 
as measured by the scale we first set up 
by fractionation and magnitude estima- 
tion, the j.n.d.’s get larger as we go up 
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the scale. Q.E.D., the j.n.d. is not con- 
stant in subjective size, and counting 
them off does not yield the magnitude 
scale. 

The principle involved can be clearly 
seen if we put the problem in more gen- 
eral terms. Let us assume that sensa- 
tion ¥ is a power function of stimulus 
magnitude S and also that Weber’s law 
is true, ie., AS = kS. By the first as- 
sumption we may write, disregarding 
constants that depend only on the choice 
of units, 

y= &. 


If Weber’s law is true, when we count 
off the j.n.d.’s and record their number 
J as a function of the stimulus, we ob- 
tain 

J = log.S, 


where c is a constant that can serve as 
the base of the logarithm. We can re- 
write this equation as 


S=¢ 


and substitute this expression for S in 
the first equation, which gives 


y=c*, 


This equation tells us that the psycho- 
logical magnitude grows as an exponen- 
tial function of the number of j.n.d.’s J, 
which brings us to the interesting con- 
clusion that the scale of sensation is 
an exponential function of the scale 
Fechner believed it to be. If we differ- 
entiate this equation we get 


dy 


— = acl", 


where a is a constant. The equation 
tells us that instead of being constant 
as Fechner assumed, the subjective size 
of the j.n.d. grows as an exponential 
function of the number of j.n.d.’s above 
threshold. Simple substitution in this 
last equation also reveals that the sub- 
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jective size of the j.n.d. is proportional 
to the psychological magnitude y. 
These equations tell us how it would 
be if Weber’s law were strictly true, but 
since this law is not strictly true the 
situation is not quite so simple. The 
complications, however, are only second 
order. A superior form of Weber’s law 
is AS = k(S + d), which was known to 
Fechner, Helmholtz and others. By the 
addition of the small constant d, this 
form makes allowance for the inevitable 
residual “noise level” in the perceptual 
system, but at the same time it pre- 
serves the basic principle that resolving 
power is relative. If this form is ac- 
cepted, the foregoing equations and deri- 
vations get complicated, but the con- 
clusions they lead to are not substan- 
tially altered. It is still true that the 


subjective size of the j.n.d. grows rap- 
idly as we go up the scale. 

Actually, since the value of the con- 
stant d is relatively small, its addition 
to the formula for Weber’s law affects 
the functions in a significant way only 


at low stimulus levels. At levels near 
threshold the general formulas devel- 
oped above break down, but over the, 
middle and higher stimulus ranges they’ 
give a good account of things. : 

A concrete instance of these general 
relations is shown graphically in Fig. 7.. 
Miller (33) measured the j.n.d. for white 
noise by the so-called quantal procedure 
(65) which provides an estimate of the 
quantal increment in the stimulus that is 
just detectable. The results are well de- 
scribed by the relation AJ = k(J + d). 
Miller summated the quanta (incre- 
ments) and obtained the top curve 
shown in Fig. 7. Except at the low 
end it is almost a straight line when 
plotted against decibels. The bottom 
curve is the sone scale of subjective 
loudness. The triangles represent the 
median results obtained by J. C. Stevens 
and E. Tulving (51) from 70 subjects 
who made direct magnitude estimations 
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Fic. 7. J.nd. (quantum) scale, category 
scale, and ratio scale (sones) for the loudness 
of white noise. Circles: with the spacing of 
the levels adjusted to give a “pure” category 
scale, 8 subjects each made two judgments of 
each level on a 7-point scale. The range was 
disclosed at the outset and the order of pres- 
entation was different for each subject. Tri- 
angles: median judgments by 70 subjects who 
made magnitude estimations of eight levels 
presented in irregular order in a classroom, 
with no designated standard. Estimates were 
transformed to a common modulus at the 75- 
db level, which was the first level presented. 


of the loudness of different levels of 
white noise presented in irregular order. 
In a log-log plot these points fall close 
to a straight line like that shown in 
Fig. 6. 

The curves in Fig. 7 show how vastly 
different are the two scales, the j.n.d. 
(or quantum) scale and the loudness 
scale. In this respect they confirm what 
we have already seen in Fig. 1. They 
also show that when we use the loud- 
ness scale to measure the subjective size 
of successive j.n.d.’s, their size grows 
rapidly as we go up the scale. Except 
near the low end, their growth approxi- 
mates an exponential function. Actu- 
ally, over the low end of the scale where 
the j.n.d. scale is curved there is a su- 
perficial similarity between the j.n.d. 
scale and the loudness scale, but over 
the major portion of the audible range 
the divergence between the two scales 
is large and obvious. 

The middle curve in Fig. 7 shows an 
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example of the function obtained when 
the loudness of a white noise is judged 
on a scale from 1 to 7 (data from Stev- 
ens and Galanter [64], Fig. 9A). The 
spacing of the stimuli had been ad- 
justed to give a “pure” category scale. 
Just as in Fig. 1, we see that the form 
of the category scale is intermediate be- 
tween that of the j.n.d. scale and the 
scale of subjective magnitude. If plotted 
against the sone scale the category scale 
would be concave downward, as are all 
category scales on prothetic continua. 
It has occasionally been asserted that 
some scales of sensory magnitude agree 
with the scale obtained by counting off 
j.n.d.’s (74). Hanes’ (27) statement 
that this is true for brightness is a recent 
example, which has led, incidentally, 
to a certain amount of confusion. In a 


subsequent study Hanes (28) extended 
the range of his measurements far 
enough for us to see clearly that the 
scale of subjective brightness (in brils) 
departs from the j.n.d. scale in the 


expected way. Perhaps Hanes should 
have stated this explicitly. 

I have summed the j.n.d.’s for bright- 
ness obtained by three different experi- 
menters (2, 18, 35) and have compared 
the results with Hanes’ bril scale. The 
pictures presented are essentially simi- 
lar to that shown for loudness in Fig. 7, 
except that the curved tail at the lower 
end of the j.n.d. scale tends to be longer. 
The basic fact is that over the middle 
and upper ranges of intensity Weber’s 
law holds fairly well for brightness, but 
the bril scale is a power function. Over 
the range of the fainter luminances, 
Weber’s law breaks down to a degree 
sufficient to produce the apparent agree- 
ment Hanes spoke of. But it is quite 
wrong to jump to the general conclusion 
that j.n.d.’s add up to the bril scale. 

We have already noted, in the dis- 
cussion of Guilford’s “nth-power law,” 
that in order for the j.n.d.’s to add up 
to give the magnitude function, AS 
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cannot in general be an increasing func- 
tion of S. For most perceptual continua 
it must be a decreasing function of S. 
But this is “unheard of in psychophysi- 
cal research.” It would mean, for ex- 
ample, that it would be easier to tell 
when a centimeter was added to a meter 
than when a centimeter was added to 
a centimeter. Resolving power simply 
does not work that way. 

Suppose now for a moment we risk 
Titchener’s displeasure and try to think 
like a physicist. Physics also faces the 
problem of resolving power and in most 
physical measurements resolving power 
is relative—the accuracy of measure- 
ment is proportional to magnitude. This 
fact is expressed by saying that length 
can be measured to 1 part in 10°, that 
resistance can be measured to 1 part in 
10°, or that time intervals can be meas- 
ured to 1 part in 10%. In this sense 
Weber’s law has its analogue in phys- 
ics. Now, it is conceivable that some- 
one might propose that we give up our 
ordinary ways of measuring length, say, 
and devise instead a scale whose unit is 
the resolving power of our best pro- 
cedures. We would then be doing what 
Fechner proposed. In terms of our pres- 
ent units, the new units of length would 
grow increasingly larger as we go up 
the scale, and they would do so for the 
same reason that the j.n.d.’s grow larger 
as we go up the psychological scale on 
a prothetic continuum. The laws of 
physics could be rewritten in terms of 
the new scales of units based on resolv- 
ing power, but it would probably strike 
the physicist as less than sensible to try 
to do it—and not simply because it 
would be hard work. For reasons that 
he could readily justify, the physicist 
wants to start with magnitudes and not 
with resolving power. What I am sug- 
gesting is that there is sufficient parallel 
between physics and psychophysics to 
justify the same approach in both fields. 
In neither discipline should we try to 
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make resolving powers, variabilities, con- 
fusions, j.n.d.’s, discriminal dispersions 
or any such entities serve as units of 
magnitude on continua that behave like 
the prothetic continua of Class I. 


ScALING THEORY 


The same counsel recommends itself 
to those of us who devise scaling pro- 
cedures for other than perceptual vari- 
ables. As Gulliksen (26) complains, 
those who worry about psychophysical 
measurement often ignore the work on 
nonsensory scaling, and vice versa. 
Titchener (71) also once complained 
that psychologists have ignored the ad- 
vances in subjective scaling achieved by 
the meteorologists. Insularity is prob- 
ably inevitable in an age of specializa- 
tion, but it has its dangers. One of the 
dangers is that the laws of human judg- 
ment developed under simple conditions, 
where we can better see what is going 
on, may be ignored when we work in 
more complicated realms where discern- 
ment is more difficult. 

A large segment of psychological scal- 
ing theory has been built around models 
that make scale “units” out of some 
measure of “discriminal dispersion,” but 
this is precisely the kind of unit that is 
demonstrably not invariant on prothetic 
continua. Variabilities and confusions 
provide a tempting starting point for 
psychological measurement, chiefly be- 
cause dispersion among our judgments 
is something we always have with us, 
but the transforming of unreliability, 
inconstancy, or confusion into units of 
measure somehow needs more than as- 
sumptions to back it up. Our funda- 
mental scales can, I think, stand on 
firmer ground than the mere scatter of 
our data. 

It is recognized, of course, that meas- 
ures of confusion often have a useful- 
ness of their own, and that for certain 
purposes it is desirable to construct 
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“equal discriminability” scales (16). 
The engineering of schemes for optimiz- 
ing the coding of information on cath- 
ode-ray tubes presents an interesting 
example of how such measures may be 
used in the prediction and control of 
confusability (36). But our concern 
here is with a different problem, namely, 
whether measures of confusion can be 
used as units of magnitude. 

The use of dispersion as a tool for 
magnitude scaling requires assumptions 
regarding its behavior. How does dis- 
persion (in psychological units) vary 
over the psychological continuum in 
question? A variety of assumptions re- 
garding the behavior of variability have 
been elaborated, but so far as I know, 
none of them come close to admitting 
the drastic growth in dispersion that we 
find to be true for Class I (prothetic) 
continua. Here, we recall, the disper- 
sion (in subjective units) approaches 
an exponential function of the number 
of j.n.d.’s above threshold or, alterna- 
tively stated, it grows almost in direct 
proportion to the psychological magni- 
tude. The assumption made in prac- 
tice is usually that dispersion (in psy- 
chological units) is constant, or nearly 
so. This is like Fechner’s assumption 
that j.n.d.’s are equal in subjective size. 
Direct experiments can sometimes verify 
this assumption on continua of Class II 
(metathetic), but on the “quantitative” 
continua of Class I they show that the 
assumption fails by a wide margin. 

On continua that behave like Class I 
we would be closer to right if we began 
with the assumption that discriminal 
dispersion is not constant but is pro- 
portional to the psychological magni- 
tude in question. When the psycho- 
logical magnitude is a power function 
of the stimulus, this assumption is 
equivalent to saying that psychologi- 
cal values separated by equal units of 
dispersion on the stimulus scale stand 
in a constant ratio to each other. If 
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we were to start from this assumption, 
could we then proceed to construct a 
useful scale of the psychological mag- 
nitude from measurements made of dis- 
criminal dispersion? In principle we 
might erect a new type of scale, but we 
could not construct an interval or a 
ratio scale of the kinds we ordinarily 
employ. While working on the theory 
of scales (54) back in the late 1930’s I 
had a hard time convincing myself that 
if we know concerning a series of values 
only that a/b = b/c=c/d=... we 
are powerless to proceed further toward 
what I have called a ratio scale of meas- 
urement. But the fact is that the equat- 
ing of ratios gets us no further unless 
(a) we can also equate intervals, or (b) 
we can determine the numerical value 
of the ratio (as we do under the ratio 
methods reviewed above). An example 
of how we might use equated ratios plus 
equated intervals to construct a ratio 
scale is discussed elsewhere (55, p. 24). 

But let us pursue for a moment the 
problem of what we might do when we 
have equated a set of ratios: a/b = b/c 
=c/d.... We can assume that we 
have an operation for ordering these 
values and that a<b<c<d.... 
The problem then is, how may we as- 
sign numerical values to this series? 
As is true when we have a series of 
equated intervals, a —-b6=b—c=c— 
d=... , we can pick any two values 
arbitrarily (subject only to the ordinal 
requirement), and having done so, all the 
other values are determined. Thus for 
the ratio series, if a = 2 and 6 = 6 then 
c = 18, d= 54, etc. The next question 
is, what transformations are permitted 
on this scale? We can show that any 
value x may be replaced by x’ provided 
x’ = kx" where & and m are constants 
that may take on any positive values. 
In other words we can always transform 
the scale values by a power function. 
For example, if we square all the values, 
the ratios remain equal. 
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A scale of this kind may be mathe- 
matically interesting, but, like many 
mathematical models, it has thus far 
proved empirically useless. At least, so 
far as I am aware, no uses for it have 
yet been found. In a formal sense there 
are certain parallels between a scale of 
equal but indeterminate ratios and a 
scale of equal but indeterminate inter- 
vals. On both, for example, any two 
values can be selected arbitrarily, and 
the scale values of both can be trans- 
formed by an equation involving two 
arbitrary constants. But the scale of 
equal intervals has great utility, as wit- 
ness the Fahrenheit and Celsius scales 
of temperature, and the scale of calen- 
dar time. And differences on interval 
scales can be measured on ratio scales, 
as when we measure intervals of dura- 
tion in seconds. But what can we do 
with equated ratios when the value of 
the ratio is indeterminate? 

An obvious suggestion is that we 
might convert to logarithms and express 
the equated ratios as log a — log b= 
log 6 — log c = log c — log d, etc. If 
we then restrict the values of a, 8, c, 

. to positive numbers, we can set up 
an interval scale in logarithms. There 
is no a priori reason why we could 
not put this scale to work in a manner 
analogous to the workings of our linear 
interval scales. The linear and the loga- 
rithmic scales have in common the fact 
that the choice of the zero point is ar- 
bitrary. It might be interesting to ex- 
plore the consequences of this procedure 
and perhaps try to answer the ques- 
tion why science has never made use of 
these logarithmic interval scales based 
on equated ratios. 

Another question arises. If we were 
to develop such scales, what would we 
call them? When I originally proposed 
(54) the names nominal, ordinal, inter- 
val, and ratio for the four classes of 
scales commonly used, I worried some 
about the scale based on equated ratios, 





ON THE PsycHopHysIcAL Law 


but I dismissed it as a purely academic 
question, for there appeared to be no 
examples of its use. If uses for it were 
to develop, we would like, perhaps, to 
call this class of scales ratio scales, for 
they are based on an empirical operation 
for equating ratios. But since the term 
ratio scale has now gained wide cur- 
rency as the name for the scales that 
admit only a similarity transformation 
(multiplication by a constant) it may 
be better to call the scale based on 
equated ratios a logarithmic interval 
scale. 

With this class of scales included, the 
hierarchy of scales, together with the 
group of transformations permitted by 
each scale, would take the form shown 
in the diagram. 


Nominal Scale 
x' = f(x) 
(permutation group) 


Ordinal Scale 
x = f(x) 


ware wt 


Linear Interval Scale Logarithmic Interval Scale 


x'=ax+b us kx” 


(linear or affine group) (power group) 


\ Ratio Scale ff 


x" = cx 


(similarity group) 


As this diagram suggests, a ratio scale 
is possible whenever empirical opera- 
tions are available to create both types 
of interval scales. If we can determine 
both equal differences and equal ratios 
we can eliminate the additive constant 
b and the exponent , and we are left 
with only a multiplying constant c. 
This defines a scale with a zero point— 
what we call a ratio scale, or, if we 
prefer, a “zero-point” scale. 

Returning now to the assumptions 
made about discriminal dispersion we 
note that, in terms of the scales listed 
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in the diagram, the assumption most 
often made is that the processing of 
variability will yield a linear rather than 
a logarithmic interval scale. It is not 
only in connection with the method 
of pair comparisons, which Thurstone 
called the “best of all the psychophysi- 
cal methods” (70), that the wrong as- 
sumption of constant dispersion (in psy- 
chological units) often gets made. This 
assumption is either latent or explicit 
in many subtler procedures, such as 
the methods of “successive intervals,” 
“graded dichotomies,” “successive cate- 
gories,” etc. (see 23). The foregoing 
arguments suggest that whenever these 
methods are applied to prothetic con- 
tinua, continua on which the j.n.d. (dis- 
persion) tends to vary directly with 
magnitude, the resulting scales will turn 
out to be nonlinear. It seems obvious 
that none of the scales listed in Table 1 
could be constructed by the method of 
pair comparisons, or by any of its sev- 
eral cousins. The laws of judgment de- 
rived from direct experiments on per- 
ceptual continua make it plain that, un- 
less it can be demonstrated that the 
judgmental continuum behaves like those 
of Class II (metathetic), any scaling 
procedure that is geared to the assump- 
tion that equally often noticed differ- 
ences are equal is defective. 
Fortunately, this does not mean that 
no scaling is possible. We still have 
other devices, such as the direct meth- 
ods for ratio scaling. Is there a genuine, 
nontrivial, substantive problem calling 
for a psychological scale for which these 
methods, or some variation on them, 
cannot produce an acceptable answer? 
Or is it merely that we like to assume, 
with Fechner, that indirection is the best 
path to the goal? Contrary to a com- 
mon assumption, the use of the direct 
methods does not require knowledge of 
an underlying, measurable physical con- 
tinuum (48). Only a nominal scale 
(55) is required at the stimulus level, 
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ie., the stimuli must be identifiable by 
the experimenter. For example, when 
the observer judges the apparent ratio 
between two stimuli the experimenter 
needs to know which two stimuli were 
involved, but he does not need to know 
their values on any other scale. I know 
from experience that it is more com- 
fortable to take it for granted that a 
direct method is impossible than it is 
to try to work one out. And it is still 
more onerous to test and refine the di- 
rect methods and to purge them of con- 
straints and biases. The problem of the 
scaling of psychological continua is full 
of booby traps, but so is the laboratory 
measurement of a physical continuum 
like, for instance, electrical inductance 
or mechanical impedance. 

Perhaps one of our professional idio- 
syncracies is that in psychology we are 
sometimes more enamored of models and 
methods than we are of problems. One 
occasionally gets the impression that 
there are more people with a method 
who are looking for a problem to use it 
on than there are searchers with a prob- 
lem looking for a method. It is ad- 
mittedly more entertaining to make a 
formal model, complete with assump- 
tions, postulates, and derivations, than 
it is to grub through the empirical 
tangle of an experimental issue. But if 
something empirically useful is to issue 
from a model, something empirically 
known must be put into it. The gra- 
tuitous assumption that some measure 
of dispersion, sensitivity, or resolving 
power can be used as the unit of a scale 
of psychological magnitude does not 
meet this requirement. 


SUMMARY 


This paper has rambled far enough 
afield at this point and it is time to try 
to restate the issues in briefer compass. 
The main points are these. 

Two general classes of perceptual con- 
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tinua can be distinguished by means of 
four functional criteria. On Class I or 
“quantitative” continua the j.n.d. in- 
creases in subjective size as psychologi- 
cal magnitude increases, category rat- 
ing-scales are concave downward when 
plotted against psychological magnitude, 
comparative judgments exhibit a time- 
order error (a “category effect”), and 
equisection experiments exhibit hystere- 
sis. On Class II or “qualitative” con- 
tinua these four effects are apparently 
absent. Class I, called prothetic, in- 
cludes those continua on which dis- 
crimination is mediated by an additive 
mechanism at the physiological level; 
Class II, called metathetic, includes 
those mediated by a substitutive mecha- 
nism. 

On Class I (prothetic) continua the 
use of one or more of four kinds of 
direct methods for constructing ratio 
scales reveals that equal stimulus ratios 
tend to produce equal subjective ratios. 
Hence, to a first-order approximation 
the “psychophysical law” relating stimu- 
lus and response is a power function. 
The exponent, as measured on fourteen 
different continua, varies from about 0.3 
for loudness to about 2.0 for visual flash 
rate. A few workers in the past have 
conjectured this power function, even 
starting sometimes from wrong assump- 
tions. Only lately, however, is this law 
becoming securely anchored in experi- 
ment. 

Fechner’s logarithmic law is not found 
in experiment for the simple reason that 
resolving power (the j.n.d.) is not con- 
stant in psychological units, but is 
roughly proportional to psychological 
magnitude. For this reason, all pro- 
cedures of Fechnerian extraction, like 
the method of pair comparisons and its 
related techniques, which seek to build 
scales out of “unitized” measures of 
dispersion, are not proper methods for 
scaling magnitudes that behave like 
Class I or prothetic continua. When- 
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ever psychological scales are called for, 
direct ratio scaling methods should prob- 
ably be tried. 
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This paper presents a quantitative 
description of some simple selective- 
learning data. The descriptive sys- 
tem is shown to subsume the writer’s 
theory of discrimination learning (11), 
and to relate closely to Estes’ statisti- 
cal theory (5). 

Recent mathematical formulations 
of learning (2, 3, 4, 5, 6, 11) describe 
the stimulus situation as a set of ele- 
ments, each of which is conditioned to 
(tends to evoke) exactly one response 
at a given time. During learning, if 
a certain response A, is reinforced, a 
cue may switch and become newly 
conditioned to A;. The probability 
of such a change is the rate-of-learning 
parameter. Estes (4, 5, 7) and Bush 
and Mosteller (2, 3) have shown that 
such a model has many consequences 
for learning with random reinforce- 
ment schedules, correctly predicting 
asymptotes in guessing experiments 
(3, 5, 7). However, certain discrep- 
ancies have appeared when theoretical 
expectations are compared with per- 
formance on early trials. Estes* has 
listed some problems which, if solved, 
would contribute to a workable de- 
scriptive theory of elementary learn- 
ing. One of these is the tendency of 
the parameter representing learning 
rate to vary systematically with rein- 

1 Appreciation is due Dr. W. K. Estes for 
his advice and criticism, and for lending pre- 
publication drafts of papers. The writer has 
benefited from extended seminars and con- 
versations with L. B. Wyckoff, P. Suppes, and 
D. H. Howes. 

2 Now at Michigan State University. 

3 See Estes, W. K., The statistical approach 
to learning theory (draft), prepared for the 
report of APA “Project A,” Analysis of 
psychological science. 
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forcement probabilities. In general, 
learning is more rapid when the prob- 
abilities of reinforcement are more 
extreme. A second problem is that 
subjects can master discrimination 
problems to perfection even in the 
presence of “‘overlap’’ stimuli which 
are not differentially reinforced. The 
writer’s discrimination theory (11) 
has some success in resolving the 
stimulus-overlap problem by assum- 
ing that irrelevant cues are adapted. 
This paper extends this discrimination 
theory to cases of random reinforce- 
ment, using Estes’ equations for prob- 
ability learning, but, in addition, 
offering a descriptive account of how 
learning rate may depend on reinforce- 
ment probability. 

The major points to be made in this 
paper are that (1) the stimulus situ- 
ation in two-choice learning experi- 
ments is represented by a set of dis- 
criminable aspects called ‘‘cues’”’; (2) 
a cue may be “‘conditioned”’ to either 
response ; (3) a cue may be “‘adapted”’ 
and rendered nonfunctional during 
learning; (4) the probability of a re- 
sponse is the proportion of the un- 
adapted cues conditioned to it; (5) 
the degree to which a cue is differenti- 
ally reinforced determines its ‘“‘val- 
idity’’; and that (6) the rates of con- 
ditioning and adaptation depend on 
the validities of the cues available. 

When these assumptions are stated 
quantitatively, the theory is shown to 
have three characteristics: first, under 
certain limiting experimental condi- 
tions it leads to a process of probabil- 
ity learning similar to that in Estes’ 
theory, and in particular with the same 
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asymptotes; second, when applied to 
discrimination learning it has the same 
consequences as the writer's earlier 
discrimination-learning theory (11), 
with the same way of handling stimu- 
lus overlap; and third, it describes the 
connection betweeen reinforcement 
probabilities and learning rates in the 
simplest probability-learning experi- 
ments. 


THEORY 
The Set of Cues 


We think of the stimulus situation 
as composed of a set of discriminable 
aspects, called ‘‘cues,”’ k, k’, k”. The 
set of cues is called K and the number 
of cues is N. The proportion of cues 
belonging to any manipulated aspect 
of the situation is measured from be- 
havior, and the number of cues, N, is 
for most purposes arbitrary. 


Conditioning of Cues 


Following Estes (4), we assume that 
a cue is conditioned to one or the 
other response alternative at any 
time, on an all-or-none basis. The 
probability that cue k is conditioned to 
response A, at trial m is called F(k,n). 
If a cue is conditioned to A» and then 
A, is reinforced, it may switch over 
and become conditioned to A;. The 
probability of such a switch is a con- 
stant called 0, the “rate-of-learning”’ 
parameter. On this assumption, we 
get the following equations of change 
of F(k,n): 


i) If A, is reinforced on trial n, 
F(k,n+1)=F(k,n)(1-—0)+0. (1) 
ii) If A» is reinforced on trial n, 
F(k,n + 1) = F(R,n)(1 — 6). (2) 


If A, is reinforced on some fixed 
proportion m of the trials and Ag is 
reinforced on all other trials, the mean 
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equation is 
F(k,n+1)=F(k,n)(1—0) +70 (3) 
which has the solution 


F(k,no +n) = 2 
— [w — F(k,mo) ](1 — 0)". (A) 


Equation 4 is the conditioning equa- 
tion which will be used in the applica- 
tions discussed in this paper. 


Adaptation of Cues 


During learning, a cue may be- 
come ‘‘adapted’’ (11) and lose its 
effect on response. Different cues 
have different pi abilities of be- 
coming adapted. ii cue & is not 
adapted by the beginning of trial n, 
the probability that it will be adapted 
by trial 2 +1 is $(k). This con- 
sideration gives us an equation for 
a(k,n), the probability that cue e is 
adapted at the beginning of trial n, as 
follows: 


a(k,n + 1) = a(k,n) 
xX [1 — (2) ] + o(k), (5) 


which has the solution 


a(k,no + m) = 1 — [1 — a(k,m) | 
x [1 — o(k)]""*. (6) 


Equation 6 is the adaptation equation 
which will be used in applications. 


Probability of a Response 


The probability of response A; is 
the proportion of unadapted cues 
conditioned to it. The probability 
that cue k is unadapted is 1 — a(k,n), 
and the probability that it is condi- 
tioned to A; is F(k,n). Thus, 


_ 2F(b.n)[1 — o(k,n)] 
ep) = “sit aka)” 


(ka member of K). (7) 
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Validity of Cues 


The experimenter determines the 
validity of a cue by differential rein- 
forcement. Suppose that one re- 
sponse to cue k is reinforced proportion 
x of the time and the other is reinforced 
l-r of the time. The cue is a good 
predictor of reward if w is near one or 
zero, and it is useless if = .5. The 
term ‘“‘validity’’ underscores an anal- 
ogy between the problem facing an S 
in selective learning and that facing a 
diagnostician. As the diagnostician 
tries to predict some criterion, using 
various test items with varying validi- 
ties, so the S tries to select the correct 
response, using various cues which 
may be related in varying degrees to 
reward. 

Validity of a cue, V, may be meas- 
ured in several ways, but to get the 
simplest empirical laws it seems best to 
define it as 


V=4r-47+1. (8) 
The use of a quadratic function is sug- 
gested by the possibility that validity 
depends on the proportion of pairs of 
trials on which consistent information 
is received, 7 + (1 — w)*, and on 
which contradictory information is re- 
ceived, 2 r(1 — x). The coefficients 
of the quadratic equation are fixed by 
making the validity of a consistently 
reinforced cue (+ = 1 or = 0) equal 
to one, and the validity of an irrele- 
vant cue (x = .5) equal to zero. 


Rates of Conditioning and Adaptation 
as Functions of Cue-Validity 


In the writer’s discrimination learn- 
ing theory (11) it was assumed that 
the rates of conditioning of relevant 
cues, 6, and adapting irrelevant cues, 
¢(k’), were equal to one another and 
were also equal to the proportion of 
relevant cues. ‘This basic idea is now 
generalized to experiments with vari- 
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ous reinforcement probabilities and 
to cues with validities between zero 
and one. 

It is assumed that the rate-of-learn- 
ing parameter controlling condition- 
ing of cues, 0, is equal to the mean 
validity of cues. That is, 


> V;/N, 
(k a member of K). 


= 
(9) 


If all cues have validities 1 or 0, as is 
the case in the discrimination theory, 
then the mean validity is equal to the 
proportion of relevant cues. 

It is assumed that the probability 
of adapting cue k’ depends on the 
number of cues more valid than k’, 
and the degree to which they are more 
valid. Mathematically 


o(k’) = = (Vi — Ver)/N, (hk, R’ 
in K,and V,> V;,-). (10) 


If Vy = 0, then ¢(k’) = @ and the 


probability of adapting an invalid cue 
is the same as the probability of con- 


ditioning. If there is no cue with 
higher validity than k’, then $(k’) 
= 0 and k’ is not adapted. 


RELATION TO STATISTICAL 
LEARNING THEORY 


It was stated that the present 
theory is, under certain circumstances, 
very like Estes’ statistical learning 
theory. If all relevant cues have the 
same probability, F(k,n), of being 
conditioned to the A, response, and if 
all irrelevant cues are adapted, then 
from Equation 9 


p(n) = F(kR,n). (11) 


If we substitute p(m) for F(k,m) in the 
difference Equations 2 and 3, we have 
the simplest cases of statistical learn- 
ing theory. From Equation 7 we see 
that as increases, a(k’,n) asymptoti- 
cally approaches 1, so that after suffici- 
ent training all irrelevant cues will be 
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adapted. Thus as training progresses 
the present theory gets closer and 
closer to the Estes model, and the 
asymptotes of the Estes model are in 
at least some cases the asymptotes of 
the present theory. In analysis of 
the present cue-learning theory it is 
often convenient, if only asymptotic 
probabilities are of interest, to study 
Estes’ simpler model.‘ 

This correspondence between the 
present theory and statistical learning 
theory is neither an accident nor a 
convergence of independent theoreti- 
cal approaches. The elegance and 
empirical success of the Estes and 
Bush-Mosteller formulations led the 
present writer to adopt their funda- 
mental notions and formulas, and the 
correspondence to earlier models was 
built into this theory. 


RELATION TO DISCRIMINATION 
LEARNING THEORY 


In an earlier paper (11) the writer 
offered a theory of two-choice dis- 
crimination with consistent reinforce- 
ment of relevant cues. A discrimina- 
tion learning problem was described 
as having r relevant cues (with valid- 
ity V, = 1) and 7 irrelevant cues with 
validity V» = 0. At Trial 1 neither 
relevant nor irrelevant cues are differ- 
entially conditioned—a situation we 
may represent in the present notation 
as 
(12) 


F(k,1) = F(R’) = .5. 


p(no+n) 
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Similarly, no cues are adapted at 
Trial 1, so that 
a(k,1) = a(k’,1) =-0. (13) 
From Equation 9, 
6 = r/(r + 1) 
and from Equation 10, 
o(k) = 0 
o(k’) = 6. 


These bounding conditions inserted in 
Equations 4, 6, and 7 yield the equa- 
tions of discrimination learning theory. 


(14) 


(15) 
(16) 


VALIDITY AND THE RATE OF LEARNING 
IN GUESSING EXPERIMENTS 


Several experiments on guessing as 
a function of the probability of rein- 
forcement have found that the rate- 
of-learning parameters depend on 7. 
Our notion of cue-validity, and the 
assumption that learning rates de- 
pend on validity (Equations 9 and 10), 
are intended to give a quantitative 
description of such findings. 

In each such experiment we assume 
that there are r relevant cues with re- 
inforcement probability 2, and 7 ir- 
relevant cues with reinforcement prob- 
ability .5. Let k bea typical relevant 
cue, and k’ a typical irrelevant cue. 
Substituting Equations 4 and 6 into 
Equation 7 and noting that ¢(k’) = @, 
we have 





_ rlx—[e— F(k,no) ](1—8)"} +iF (k’,no+n)[1 —a(k’,mo) }(1—0)" 
¥ r+i[1—a(k’,no) ](1—0)"" ’ 


4An important advantage of Estes’ and 
Bush and Mosteller’s theories is their mathe- 
matical development, which gives detailed 
statistics of data in a wide variety of random 
reinforcement situations. The equations for 
p(n) of the present theory are considerably 
more difficult to work with in most cases. 


(17) 


If we let 6 = r/(r + 7) be the propor- 
tion of relevant cues, it follows from 
Equation 11 that 


6=6V (18) 
where V is defined by Equation 8. 
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If we divide all the terms in the 
right member of equation 17 by r + 7 
and use the definition of 6 to simplify, 
and at the same time substitute 6V 
for 8, it follows that 
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ing conditions. While the experi- 
menter may intend to reinforce alter- 
native A, on some fixed proportion + 
of the trials, he can actually accom- 
plish this only on the average. On 


5{w—[a— F(R,no) ](1—6V)"} 


+ (1—6) F(R’,not+n)[1 —a(k’ no) ](1—-8V)"— 


Inspection of Equation 19 shows that 
if (1 — 6V)""' approaches zero the 
limit of p(mo +m) is wr. In such a 
situation the subject approaches the 
“probability matching’ asymptote, 
eventually making response A; as 
frequently as it is reinforced. The 
larger V is, the more rapidly will this 
asymptote be approached, for the 
larger V the smaller (1 — 6V)"". 
Thus Equation 19 relates the speed of 
learning to validity, and through valid- 
ity to probability of reinforcement, in 
an appropriate fashion. 

The aim of this paper is to make 
accurate quantitative statements 
about the relationship of learning rate 
to V. To do this we must establish 
the starting conditions by determin- 
ing for some trial, mo, the values of 
F(k,no), F(R',mo), and a(k’,no). 


Starting Conditions in Guessing Ex- 
periments 


In the experiments that we wish to 
consider, subjects commonly show a 
noticeable bias, so that p(1) ¥.5. 
It is pcssible to estimate p(1) from the 
proportion of A; responses made on 
early trials. We do not know which 
cues are responsible for any bias that 
appears, so for convenience we assume 
that the initial bias is distributed 
equally over all cues. That is, we 
assume as a starting condition that 


F(k,1) = F(k’,1) = p(i). 


Besides initial biases, one further 
consideration arises in regard to start- 


(20) 


5+ (1—6)[1—a(k’,mo) ](1—5V) 


(19) 


the first trial he must reinforce one 
response or the other, and the actual 
relative frequency of reinforcements 
of A, becomes either 1 or 0. It may 
happen that the subject gets only rein- 
forcements of A; and no reinforce- 
ments of A» for the first four or six or 
even ten trials, though some Az rein- 
forcements are to be given. During 
this initial block of trials the problem 
given the subject is just the same as 
if the experimenter intended to give 
reinforcement consistently with 7 = 1. 
Thus, for some initial block of mo 
trials, the subject works on a problem 
with actual = 1 and V = 1, and 
then shifts over to a problem with 
intermediate reinforcement probabili- 
ties. 

An exact learning curve p(m) can 
be written using the actual sequences 
of reinforcements. Computations are 
easier if we assume that, for the first 
no = r/(1 — x) trials, only A, rein- 
forcements are given (mr > .5), on 
Trial m) +1 an Az» reinforcement is 
given, and on all subsequent trials 7 
is a satisfactory approximation to the 
actual relative frequency of A, rein- 
forcements. 


Probability of an A, Response on 
Trial n in a Guessing Problem 


During the first mo trials the subject 
has a problem just like simple dis- 
crimination learning with starting 
probability (1). Thus for any n’ 
less than or equal to mp + 1, 
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p(n’) = 


whereas for any later trials m»+ m we 
employ Equation 19 with 


F(k,mo + 2) = {1-[1 — p(1)] 
X (1 — 6)¥°} (1 — 8) 
F(k’,no + 2) = 5 — (.5 — p(l)) 
x (1 — 6)™t 


a(k’,no + 2) = 1 — (1 — 8)reH, 


In this fashion we construct a theoret- 
ical learning curve for learning in a 
guessing experiment. 

In order to estimate parameters 
and test the theory conveniently, we 
wish to avoid the laborious process of 
fitting Equations 19 and 21. To do 
this we use the theoretical ‘‘total 
score” (as in 11, Equation 8), ob- 
tained by cumulating the learning 
function p(m). The method, accurate 
to a good approximation, is to replace 
the discrete function p(m) by a func- 
tion of a continuous variable t, and 
integrate the resulting p(t). The 
integration is elementary® and gives 
an explicit function which, while 
bulky, is usable. We shall call this 
integral (which is not given here) 
7, representing the cumulative p(n) 
from 1 tom. 7, is the total number of 
A, responses made on the first m trials 
of training. 


Empirical Estimates of V as a Function 

of 

If two or more groups of subjects 
from the same population are trained 
with the same apparatus and instruc- 
tions but with different reinforcement 
probabilities, we assume they have the 
same cues available, and thus the 
same proportion of relevant cues 4, 
though the validity of the relevant 
cues will differ. In such an experi- 


5 See Burington’s Tables (1, p. 82), formulas 
309 and 310. 
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6{1 — [1 — p(1)](1 — 4)"} + (1 — 4)p(t)(1 — 8)" 
6+ (1 — 6)” 


a ae 


ment one group of subjects can be 
used to estimate 6, and this same 6 
can be used for the other group, per- 
mitting a direct numerical estimate of 
V as a function of x. Grant, Hake, 
and Hornseth (9) ran five groups of 
twenty subjects each in a guessing 
experiment with reinforcement prob- 
abilities of 1.00, 0.75, 0.50, 0.25, and 
0.00. Symmetry in the data permits 
us to lump the1.00and the0.00 groups, 
calling them a = 1, and the .75 and 
.25 groups, calling them z = .75. 
The .50 group showed virtually no 
learning, as expected, and is disre- 
garded. 

The data from this experiment are 
used to check the assumed validity 
function given in Equation 8. The 
relevant cues for the + = 1 group 
were assigned validity V = 1. From 
this group we estimated the propor- 
tion of relevant cues, 6. Using this 
value 6, we then found the value of V 
which would cause the .75 group to 
make 34.5 A, reponses in 60 trials, 
as observed. The results of the 
computations are shown in Table 1. 
The obtained value of V at .281 is 
relatively close to the value predicted 
by Equation 8, which is 4(.75*) — 
4(.75) +1 = .250. There seems to 
be no way to test the statistical reli- 
ability of the discrepancy. 

A second measure of V is accom- 
plished by using the data from an ex- 
periment by Goodnow and Postman 
(8). They used reinforcement prob- 
abilities of 1.0, .9, .8, .7, .6, and .5 in 
a “problem-solving” situation. A pe- 
culiarity of their technique was a pre- 
training session in which the subjects 
were instructed and trained to attend 
to the relevant cues. We shall as- 
sume that this pretraining had the 
same effect as adapting all irrelevant 
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TABLE 1 


SUMMARY OF CALCULATIONS ON GRANT-HAKE-HORNSETH EXPERIMENT 








no =x/(1—7) 


| pit) 


Estimated V Predicted V 
| 





1.00 | 46 


75 46 


| 
| 
| 
| 
| 


—(Assumed to be 1.0)— 





| 250 





cues before the actual training trials 
began. The assumption is stated as 


a(k’,1) = 1 


for any irrelevant cue k’. With this 
assumption we see that, according to 
Equation 7, for all trials n, 


p(n) = F(R,n). 


With this special assumption it is 
possible to sum p(m) directly, dis- 
pensing with the approximation. 

In analysis of the Goodnow-Post- 
man experiment we again use the 
x = 1 group to estimate 4, the pro- 
portion of relevant cues. For all 
other groups all parameters except 
p(1) were known, and estimates of 
p(1) were made easily, since all groups 
showed about the same bias. Under 
these circumstances it is possible to 
make empirical estimates of V as a 
function of reinforcement probability. 
This was done using the .9, .8, .7, and 


.6 groups. The detailed results of the 
computations are shown in Table 2. 
The correspondence between esti- 
mated and predicted values of V is 
quite good, especially for those groups 
which learned slowly and had small 
values of V. Since there were only 
eight subjects per group the estimates 
of V are probably not highly accurate. 
However, when the estimates of V 
from the Grant-Hake-Hornseth and 
Goodnow-Postman experiments are 
plotted against the prediction from 
Equation 8, as in Figure 1, the results 
are encouraging. 

If we accept Equation 8 as repre- 
senting the relationship between valid- 
ity (and thus learning rate) and z, the 
theory can be used to explain the de- 
tailed results of guessing experiments 
reported by Estes and Straughan (7) 
and Jarvik (10). Estes and Straughan 
trained 16 subjects in each of two 
groups with reinforcement probabili- 


TABLE 2 





no=x/(1—) | 








Estimated V Predicted V 


— (Assumed to be 1.00)— 


860 | 640 





.260 360 





| 160 





.040 
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TABLE 3 


SUMMARY OF CALCULATIONS ON THE EstTES-STRAUGHAN 
AND JARVIK EXPERIMENTS 





Estes-Straughan Experiment 





Jarvik Experiment 


6 


Theoretical V | 


Total Ai 
Responses 
(Theoretical ) 


Total Ai 
Responses 
(Obtained ) 


95.8 





Total Ai 
Responses 
(Theoretical ) 


Total A: 
Responses 
(Obtained) 


60.0 


Theoretical V | 





ties of .85 and .30. Our computation 
is to estimate 6, the proportion of 
relevant cues, from the data of the .85 
group, and then assuming that 6 is 
the same for the .30 group and V de- 
pends on 7 according to Equation 8, 
‘compute the theoretical performance 
of the .30 group. Jarvik had three 
groups with reinforcement probabili- 
ties of .744, .663, and .593. The 
data of the .744 group are used to 
estimate 6, and then performance 
curves are predicted for the other two 
groups according to the theory. The 
detailed results of these computations 
are shown in Table 3. Inspection of 
the results shows that the theory is 
able to predict the total A; reponses in 
m trials with acceptable accuracy in 
these two experiments. 

Figure 2 summarizes the empirical 
accuracy of the proposed theory. The 
group with the highest reinforcement 
probability in each experiment is used 
to predict the results of the other 
groups, and theoretical expectations 


are compared with observed data. In 
each of the predictions the starting 
probability p(1) was estimated from 
the data of the group predicted. 
Other than this, the predictions are 
computed without reference to the 
performance of the group predicted. 
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forcement probability—theoretical hypothesis 
(Eq. 8) and empirical estimates. 
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response scores in four experiments with 
empirical observed means. 


DISCUSSION 


The theoretical formulation pre- 
sented in this paper has been shown 
to give rise to the same asymptotes as 


does Estes’ theory in simple prob- 
ability-learning situations, to have the 
same consequences as the writer's 
earlier discrimination learning theory 
when applied to such data, and to give 
a quantitative account of the relation 
between reinforcement probability 
and speed of learning in guessing 
experiments. 

As this theory describes learning, the 
probability of learning a particular 
cue depends on what other cues are 
present and on the correlations of 
those cues with reinforcements. This 
assumption is quite different from the 
approach taken by Estes and Burke 
(6), who suppose that each element 
has its own invariant sampling param- 
eter. If the Estes-Burke model is 
right, a complex problem can be pre- 
dicted theoretically as an assembly of 
stimulus elements, each with its own 
fixed parameter. Using the present 


theory, we consider a complex prob- 
lem as an assembly of elements which 
may appear in simpler problems, but 
the parameters of these elements de- 
pend on the context of the problem as 
awhole. While analysis of a problem 
as a whole seems at first too difficult, 
the results reported in this paper indi- 
cate that appropriate simplifying as- 
sumptions may make definite compu- 
tations feasible. 

The empirical results show that the 
present theory correctly describes the 
relation of learning rate to reinforce- 
ment probability. These results do 
not constitute evidence that the theory 
is correct, since the theory was formu- 
lated in the light of those results. In 
particular, the function relating valid- 
ity to reinforcement probability, Equa- 
tion 8, was selected because it seemed 
to fit the data. It is hoped that this 
formulation will stimulate the collec- 
tion of more precise parametric data 
regarding the rate of learning under 
conditions of probable reinforcement. 
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VALUE AND THE PERCEPTUAL JUDGMENT OF MAGNITUDE’ 


H. TAJFEL 


University of Durham, England 2 


Problems connected with the influences 
exercised by motivational factors on per- 
ception have been subject to much de- 
bate in recent years. It seems that the 
dust of controversy is beginning to set- 
tle. A reaction is setting in (e.g., 17) 
against premature attempts to construct 
theories which would cope with the 
enormous variety of facts in this field, 
even before many of the facts have been 
properly established. These general 
theories were based in part on an a 
priori assumption that ali the phenomena 
concerned can be reduced to, and for- 
mulated in terms of, a few very wide 
and nonspecific principles. No one can 
tell at present whether such an assump- 
tion is justified; but its hasty acceptance 
seems to have led to a confusion of ex- 
perimental issues which has not been 
avoided either by those who stressed the 
importance of motivational variables in 
perception or by those who persistently 
denied it. The primary concern of this 
paper is with only one of these issues: 
the phenomena of perceptual overesti- 
mation, presumably due to value or 
some related variables. One of the rea- 
sons for this concern is that there exists 
a body of positive experimental findings 
in this field which cannot be ignored, 
and for which a consistent explanation 
has not yet been offered. 

In the last fifteen years or so about 
twenty experiments on various aspects of 
overestimation have been conducted. Of 
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2 Now at the University of Oxford, Delegacy 
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those, only two have yielded unambigu- 
ously negative results (3, 21). Partly 
negative results have been reported by 
Carter and Schooler (8), and by Klein, 
Schlesinger, and Meister (19). All other 
workers were able to conclude that, in 
the situations which they were using, 
“motivational” or “value” variables had 
an effect on their subjects’ perceptual 
judgments of magnitude. Shifts in judg- 
ments of size (e.g., 1, 2, 5, 6, 7, 10, 20, 
26), weight (e.g., 9), number (e.g., 25), 
and brightness (13) have been reported. 

All this evidence cannot be dismissed 
as an accumulation of experimental arti- 
facts. Furthermore, perceptual accentu- 
ation need not be considered as a “mal- 
adaptive” phenomenon. The fact that 
it may represent a departure from the 
“objective reality” leads to a general 
criticism which consists in pointing out 
that in order to survive we must per- 
ceive the world as it is, that we usually 
do, and that therefore the fleeting phe- 
nomena of overestimation are more typi- 
cal of the specific laboratory situations 
in which they have been demonstrated 
than of perception under normal condi- 
tions. There is, however, a possibility 
that the shifts in the judgments of mag- 
nitude which concern us here do not 
interfere with an adequate handling of 
the environment. They may even be 
of help. 


A CLASSIFICATION 


The experiments on overestimation 
fall naturally into two classes. In one 
group, changes in the magnitude of the 
stimuli under investigation are relevant 
to the changes in value. The experi- 
ments on coins provide an example here: 
in general, the larger the coin, the greater 
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its value. The experiment of Dukes 
and Bevan (9), in which judgments of 
weight of jars filled with sand were com- 
pared with judgments of weight of jars 
filled with candy, would also be “rele- 
vant” in this sense: heavier jars would 
presumably contain more candy, and 
thus have greater “value.” On the other 
hand, several experiments have been re- 
ported in which changes in value have 
in no apparent way been related to 
changes in the physical dimension which 
the subjects were requested to judge. 
Thus, in the experiment by Lambert, 
Solomon, and Watson (20), the color of 
the disc was the determinant of its value, 
as red discs only were associated with 
reward; but judgments of size were re- 
quested of the subjects. In the experi- 
ments by Bruner and Postman (6) and 
by Klein, Schlesinger, and Meister (19), 
discs containing a swastika were among 
the stimuli used. Judgments of their 
size were compared with judgments of 


size of discs containing neutral symbols. 
Here again, the size of the swastika has 
no easily conceivable relationship to its 


degree of relevance. Further examples 
from both classes of experiments could 
be given. 


“RELEVANT” DIMENSIONS AND THE 
ACCENTUATION OF DIFFERENCES 


Let us first deal with the “relevant” 
group of experiments. The stimuli 
whose magnitudes were judged in these 
investigations form, by definition, a 
series varying concurrently in at least 
two dimensions: the physical dimension 
(size or weight), and the “dimension” 
of value. The concern of most experi- 
menters has been to show (or to deny) 
that the stimuli of the series in which 
a variation in magnitude was paired in 
the environment with a variation in 
value were judged larger than stimuli of 
objectively equivalent magnitudes be- 
longing to a different, “neutral,” series. 
Thus, judgments of size of coins were 
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compared with judgments of size of 
cardboard or metal discs, and judgment 
of weight of jars filled with candy with 
similar judgments of jars filled with 
sand. Little attention has been paid to 
the perceived differences between the 
magnitudes of the stimuli in the “val- 
ued” series, as compared with the cor- 
responding relationships in the “neutral” 
series. 

It seems that some, at least, of the 
apparent contradictions between the var- 
ious experimental results can be resolved 
if this intraserial aspect of the situation 
is considered. On the basis of an argu- 
ment to be developed later, a prediction 
can be made that in a “relevant” series, 
where value changes concurrently with 
the dimension subjected to investigation, 
the differences between the stimuli of 
the series will be perceived as larger 
than the objectively equivalent differ- 
ences between the stimuli of a neutral 
series, where no such association exists 
between value and magnitude. This 
comparison of the perceived differences 
within the two series would acquire its 
fullest meaning when _intraindividual 
data are considered. Large individual 
variations are to be expected in the fine- 
ness of discrimination for both relevant 
and neutral series, but the tendency for 
a larger accentuation of differences in 
the former is likely to appear in most 
subjects, independently of the absolute 
values of the differences. 

All the results to date in the “rele- 
vant” group of experiments support this 
contention. The quantitative difference, 
as expressed by subjects’ judgments, be- 
tween the two extremes of a series has 
been adopted as a rough measure of the 
accentuation of differences. This limita- 
tion was found necessary for two rea- 
sons. In the first place, coins, for ex- 
ample, differ not only in size and value. 
Secondly, it is possible that other de- 
terminants of quantitative judgment, 
such as the interaction of various gen- 
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TABLE 1 


(CONVERTED FROM CARTER AND 
ScHOOLER, P. 202)* 





Percentage Increase 
from Actual to 
Perceived Extension 


Stimuli 





Cardboard discs 
Aluminum discs 
Coins judged when present 
Coins judged from memory 


16.81 
16.20 
33.91 
38.69 








* Judgments of the “dime” and of the corresponding 
discs in other series have not been taken into account 
in this table, as the ‘‘dime"’ represents an exception in 
the concomitant variation of size and value in coins. 


eralization gradients (cf., eg., 18), 
might counteract in some segments of 
the series the expected accentuation of 
differences. 

A glance at Fig. 1 in the early study 
by Bruner and Goodman (5) shows 
that the perceived difference between 
the smallest and the largest coins (ex- 
tension of the scale of judgments) is 
much larger than the corresponding ex- 
tensions for discs. A conversion of the 
data in the experiment by Carter and 
Schooler (8, Table I, p. 202) to similar 
extensions, for all subjects combined, 
shows that: 


a. In all series of stimuli (coins, alu- 
minum and cardboard discs) the per- 
ceived extensions are larger than the 
actual ones; 

b. The relative differences between 
the actual and the perceived extensions 
are about twice as large for the series 
of coins as for both series of discs (see 
Table 1). 


Carter and Schooler imply this when 
they “ ... suggest that there is a con- 
stant error involved in making these size 
judgments such that small coins are 
underestimated and large coins are over- 
estimated in size” (8, p. 205); but they 
do not seem to draw further conclusions. 

Bruner and Rodrigues (7) have in- 
troduced in their well-controlled experi- 


TABLE 2 


(CONVERTED FROM BRUNER AND RODRIGUES, 
TABLE 6, P. 21) 


Percentage Increase from 
| Actual to Perceived 
: : Extension 
Stimuli | 


Value Set Accuracy Set 





73.7 
55.7 
39.8 


68.2 
34.9 
20.9 


Coins (on table) | 
Metal discs 
Paper discs 





ment the notion of “relative increase in 
overestimation” in which some of the 
suggestions made here are already im- 
plicit. Half of their subjects were as- 
signed to a “value set” in which the in- 
structions to the subjects emphasized the 
purchasing power of money. The other 
half were assigned to an “accuracy set” 
in which the experimenters’ concern with 
accurate judgments of size was stressed. 
A conversion of their data similar to 
the above conversion of Carter and 
Schooler’s data gives similar results (see 
Table 2). 

This table is suggestive in several 
ways: 


a. The same general tendency as in 
Carter and Schooler’s data emerges: the 
relative extensions for the value series 
are considerably larger than for both 
neutral series. 

5. All extensions in the “accuracy 
set” are larger than the corresponding 
extensions in the “value set.” This is 
not inconsistent with the general prop- 
osition put forward here that the rele- 
vance of a series of stimuli is related to 
an emphasis of the differences between 
them. It is reasonable to assume that 
an appeal for accuracy of judgment 
would lead to similar results in its stress 
on clear discrimination between the 
stimuli of the series. 

c. The differences between the exten- 
sions of the value and the neutrai series 
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are less marked in the “accuracy” than 
in the “value” set. This is also con- 
sistent with the position adopted in this 
paper. An appeal for accuracy would 
have little effect in a series associated 
with value, where the accentuation of 
differences between the stimuli exists 
already as a long-standing product of 
past experience (cf. McCurdy’s discus- 
sion of the role of schemata in the per- 
ception of coins, 22). This same appeal 
would have comparatively more effect 
in the case of a neutral series. 


One further example: Dukes and 
Bevan (9) found that for their “posi- 
tive” series (jars filled with candy) the 
variability of responses was less marked 
than for the “neutral” series (jars filled 
with sand). This is related to smaller 
j.n.d.’s in the positive than in the neutral 
series. In other words, the differences 


between the stimuli of the positive series 
were more clearly and consistently per- 
ceived than the objectively equivalent 


differences in the neutral series, and the 
scale of judgments was more extended 
for the positive than for the neutral 
series. 

This summary of evidence suggests 
that the results of the “relevant” ex- 
periments on overestimation cannot be 
solely the product of some simple and 
rather mysterious process of overestima- 
tion. As has already been pointed out, 
the interest of the experimenters was 
confined mainly to the comparisons of 
the perceptual judgments of stimuli in 
a value series with the judgments of 
physically equivalent stimuli in a neu- 
tral series. The subjects were invited 
to make comparisons between the valued 
stimuli and either a neutral standard or 
elements of a parallel neutral series 
(with the exception of the experiment 
by Dukes and Bevan, in which com- 
parisons between the stimuli of the value 
series were also included). However, 
the implications of the fact that the 
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stimuli associated with value do form a 
series cannot be ignored; during the ex- 
periments the subjects were repeatedly 
exposed to the various elements of this 
series, and the “belongingness,” in the 
case of coins, would be further enhanced 
by familiarity. 

In view of the evidence concerning the 
effects that all elements of a series, past 
and present, have on the quantitative 
judgments of its individual members 
(e.g., 14, 18), it may reasonably be as- 
sumed that the judgments of magnitude 
given by the subjects in the “relevant” 
overestimation experiments were not 
only determined by the perceived rela- 
tionship, at the time of judgment, be- 
tween a stimulus of the value series and 
a standard. They must have been af- 
fected as well by the background of per- 
ceived relationships between this par- 
ticular stimulus and all other stimuli of 
the same series. This assumption is 
further supported by the evidence that 
the effects exercised by a particular stim- 
ulus on judgments pertaining to a series 
of stimuli increase as a function of the 
extent to which this stimulus is per- 
ceived as forming part of the series 
which is being judged (4). 

In other words, in the “relevant”’ ex- 
periments on overestimation two aspects 
of the situation must be taken into ac- 
count: the interserial and the intraserial. 
The first consists of the perceived rela- 
tionships of magnitude between any 
stimulus of the value series and the 
neutral stimuli; the second is concerned 
with the perception of relationships be- 
tween the stimuli of a value series as 
compared with the corresponding rela- 
tionships in an objectively identical neu- 
tral series. 

The phenomenon of accentuation of 
differences between the stimuli in a value 
series can be isolated in its “pure” form 
when the following two requirements are 
satisfied: the subjects’ judgments must 
be based on comparisons between the 
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various stimuli of this series, and not 
between these stimuli and some extrane- 
ous ones; and the value and neutral 
series must be objectively identical, value 
being the only experimental variable in 
which they differ. These requirements 
were satisfied in some experiments re- 
cently completed by the writer (27).° 
An ordered series of ten weights was 
used, and the subjects were requested to 
judge their heaviness in terms of seven 
category numbers. Each subject under- 
went an equal number of sessions under 
the “value” and under the “neutral” con- 
ditions. An experimental session con- 
sisted of two parts: in the first part, the 
entire series was presented several times 
in random order, but no judgments were 
reported by the subject. In the second 
part, following the first after an interval 
of about three minutes, all the stimuli 
were presented again several times, and 
judgments of weight were requested at 
each presentation. In the value condi- 


tion, a small paper bonus (gift certifi- 
cate, exchangeable for a book) accom- 
panied each presentation of one of the 
two heaviest (or two lightest) stimuli 
of the series during the first part of the 


session. No rewards were given in the 
first part of the neutral sessions. In 
this way, the effects of the two experi- 
mental conditions, introduced in the 
first part of the sessions, on judgments 
of weight in the second part could be 
assessed. A total of 60 adult subjects 
were used in the four experiments. They 
were told, as part of the instructions, 
that the purpose of the experiment was 
to investigate the effects of monotony on 
the speed of performing a simple task 
which consisted of discriminating be- 
tween weights. The “small paper to- 
kens,” which they were to receive from 
time to time, were being introduced in 
order “to vary the degree of monotony.” 


8 This investigation was supported by a 
grant from the Durham Colleges Research 
Fund. 
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A questionnaire, which was presented to 
the subjects after completion of all the 
experiments, revealed that none of them 
doubted the truth of these statements. 
The subsequent disclosure of the ex- 
perimenter’s bad faith caused consider- 
able surprise. 

Data consisted of differences in the 
extensions of the scales of judgment be- 
tween the two conditions for each of the 
subjects. Results can be briefly sum- 
marized as follows: in the first two ex- 
periments, in which rewards were as- 
sociated with the two heaviest and the 
two lightest stimuli of the series, respec- 
tively, extensions of the scales of judg- 
ments were significantly larger for the 
value than for the neutral condition. In 
a third experiment, in which rewards 
were associated indiscriminately with 
all the stimuli of the series, no such ef- 
fect occurred. In a fourth experiment, 
the procedure was the same as in the 
first and second experiments, apart from 
the fact that the paper “bonuses,” passed 
by the experimenter to the subjects at 
each presentation of either one of the 
two heaviest or one of the two lightest 
stimuli, were devoid of all value. Once 
again, no significant effect on the ex- 
tension of the scale of judgments was 
observed. 

All this evidence is, to say the least, 
strongly suggestive. The advantages of 
using accentuation of differences as an 
explanatory device for overestimation are 
threefold: 


a. It accounts for some seemingly 
contradictory results, such as the under- 
estimation of the small end of the value 
series reported in some studies (7, 8); 
it also accounts for Bruner and Rod- 
rigues’ (7) “relative overestimation.” 

5. It does not require an introduction 
of deus ex machina principles to ac- 
count for the phenomenon of overestima- 
tion in the “relevant” class. 

c. With some extensions, it can be ap- 
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plied to the “interserial” differences in 
the “relevant” experiments, and to most 
experiments on overestimation which fall 
outside the ‘“‘relevant”’ class. 


The first point above does not require 
further elaboration. The second can be 
supported by some lines of argument, in- 
dependent of each other. 

McCurdy (22) has recently pointed 
out that by exaggerating the differences 
between the various coins, memory com- 
mits a “good error.” In his ingenious 
attempt to relate the results of the 
studies on coins to the concept of sche- 
mata, he limited his discussion to series 
of coins where strong schemata could 
have developed through long familiarity 
with the stimuli. However, this would 
not account for the results of those ex- 
periments in which accentuation of dif- 
ferences in the value series, as compared 
with the neutral series, was found to 
exist (9, 27), in spite of the fact that 
the conditions for thé development of 
the schemata were, as far as one can 
judge, not notably different for the two 
series. 

What is the nature of the experience 
with coins, or with any series of stimuli, 
where it is important to discriminate 
sharply between the elements of the 
series? Discriminative responses to 
stimuli in such series are not usually 
made in terms of precise quantitative 
labels attached to individual stimuli. 
They are made in terms of “larger than,” 
or “smaller than,” the neighboring ele- 
ments of the series. Minimizing the 
differences entails a risk of confusion; 
accentuating them is an additional guar- 
antee of a successful response. How- 
ever, the normal routine of responding 
in terms of “larger” or “smaller” is up- 
set in most experiments on overestima- 
tion. The usual technique for correct 
handling of coins, for example, which is 
based primarily on an awareness of the 
relevant differences between a particular 
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coin and other coins of the series, pres- 
ent or absent, is not quite adequate, as 
unusually precise individual quantitative 
labels are requested in the experiments. 
These absolute labels, whether obtained 
through matching or through some kind 
of verbal categorizing, may be expected 
to reveal an accentuation of differences 
between the stimuli, since they would 
reflect, and possibly exaggerate, what is 
otherwise implicit in the relative judg- 
ments of comparison pertaining to the 
series. 

It is in this sense only that the re- 
sults yielded by the studies on over- 
estimation can be treated as experi- 
mental artifacts. In this context, it is 
not particularly important to find out 
whether the stimuli are really “seen” as 
larger or smaller. They are reproduced 
as such; to ask the subject to match a 
variable standard to a stimulus, or to 
assign to the stimulus a quantitative 
verbal label, is essentially asking him to 
reproduce its size. This reproduction, 
which involves an activity very different 
from stating vaguely that an object is 
sizably larger or smaller than some- 
thing else, lends itself easily to a sharp- 
ening of the relevant distinctive feature 
of it, which is, in this instance, its dif- 
ference in size from the next object in 
the series. The phenomenon is not un- 
familiar: Gibson reported some time ago 
(12) that a sharpening of differences 
occurred in the early stages of aircraft 
recognition training, when his subjects 
were asked to draw the silhouettes of the 
various aircraft. 

A prediction is possible here to the 
effect that shifts in the estimates of 
magnitudes would either not occur or 
promptly disappear if the training to 
discriminate between the elements of 
a series, along which discrimination is 
important for the subject, was directed 
towards the accuracy of individual quan- 
titative labels rather than towards a 
clear distinction in relative terms be- 
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tween the stimuli. This is exactly what 
happened in an experiment by Smith, 
Parker, and Robinson (25) in which 
accuracy of report was a condition of 
obtaining the reward. Prizes were of- 
fered to the subjects who would report 
correctly the greatest number of dots 
forming clusters which were flashed suc- 
cessively on a screen. In a control 
group, prizes went to those subjects who 
reported correctly on the number of 
dots in. the greatest number of clusters. 
No information about the accuracy of 
their estimates was available to the sub- 
jects during the experiment. The first 
group showed overestimation in the 
early stages of the experiment, but after 
a certain number of trials the perform- 
ance of both groups converged. 

An experiment is being designed at 
present to test a further implication of 
the present argument: that in any or- 
dered series, training to discriminate in 
comparative terms between the stimuli 
will result in larger estimated differences 
between the elements of the series than 
training to label the stimuli quantita- 
tively in familiar units of measurement. 

These assumptions of a “functional” 
mechanism underlying the accentuation 
of differences may be supplemented by 
some evidence coming from a different 
quarter. It consists of the findings about 
the effects of multidimensionality on the 
acuity of discrimination along a series. 
Eriksen and Hake (11) have reported 
recently that when the method of ab- 
solute judgments is used, the number of 
discriminable steps for a series of stim- 
uli is greater when the stimuli vary con- 
currently in two or more dimensions 
than when they vary in one dimension 
only. More specifically, they found that 
discriminability was considerably greater 
when the “stimuli varied in size and hue, 
size and brightness, hue and brightness, 
and size, hue, and brightness” (p. 159) 
than when they varied in only one of 
these dimensions. Eriksen and Hake 
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suggest that their results were due to a 
kind of summation, to “the ability of Ss 
to make fairly independent judgments of 
stimulus values along each of the com- 
They 


ponent dimensions” (p. 158). 
add, however: 


We cannot assume that this is always the 
case for compound stimuli. For some stimuli, 
judgments made of values in the separate di- 
mensions may be interrelated. That is, the 
evocation of a particular response tendency 
by a component of a compound stimulus may 
change the likelihood of evocation of other re- 
sponse tendencies by other components of the 
stimulus. We could expect this to occur when 
Ss have learned by long experience to associate 
the occurrence of certain values in one dimen- 
sion with the occurrence of particular values 
in another. 


This interrelation may account in 
part for such perceptual “errors” as the 
size-weight illusion. Ryan (23, quoted 
by Hochberg, 15) writes: “ ...O is not 
really reporting weight at all, but spe- 
cific gravity, and the illusion is really 
not an illusion at all. O is simply unable 
to report on weight being influenced by 
the striking differences in density in- 
volved.” In the field of size estimation, 
Holzman and Klein (16) have recently 
reported that heavy discs were judged 
larger by their subjects than light discs 
of identical size, and that black discs 
were judged smaller than grey discs of 
identical size and weight. They add 
wistfully that these results “ .. . do 
not exclude the possibility that value 
and need may indeed be important vari- 
ables in size judgments” (p. 40). 

Two conclusions can be drawn from 
the above: the first, supported by Erik- 
sen and Hake’s results, is that under 
some conditions of judgment, and in a 
situation new to the subject, compound- 
ing of concurrently varying dimensions 
will result in a clearer perception of dif- 
ferences between the stimuli along a 
series. The second is related to the 
first, and already outlined in the above 
e4otation from Eriksen and Hake: when 
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the association between dimensions has 
been a long-standing one, new training 
may not be capable of inducing changes 
in discriminability. In such cases, judg- 
ments along one of the dimensions have 
already been influenced, previous to the 
experiment, by concomitant changes in 
other dimensions. In other words, it is 
likely that the Eriksen and Hake situa- 
tion represents the incipient stages of a 
process which, if given an opportunity to 
develop further, might lead to interrela- 
tions such as those that Ryan suspects 
to be responsible for the size-weight il- 
lusion, or that Holzman and Klein (16) 
found between the estimated size and 
the actual weight of their discs. Inter- 
relations of this kind exist, of course, 
when there is a consistent pairing, in the 
environment, of changes in more than 
one dimension. 

Such pairing exists by definition in 
series where magnitude and value vary 
concurrently. The difficulty is that 
value is not a “dimension” in the physi- 
cal sense. It is, however, an important 
attribute of the stimuli in such series, if 
only because efficient discrimination be- 
tween the stimuli in terms of differences 
in value is, in most cases, more important 
than discrimination in terms of the 
physical dimension. It is not reasonable 
to expect that subjects who, in an ex- 
periment, judge a series of coins in 
terms of size, judge it in terms of size 
alone. This would be a feat of ab- 
stract behavior difficult for adults, and 
even more so for children. As Vernon 
(28) points out in her discussion of 
the experiment by Bruner and Good- 
man (5): “It seems fairly certain that 

. there was an inability to isolate 
a single aspect from a global percept” 
(p. 187). Or, as Hochberg writes in 
another context (15): “ ... the sub- 
jects are responding in terms of a di- 
mension which specifies the most about 
the stimulus series with the least num- 
ber of categories, rather than in terms 
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of the dimension expected by the ex- 
perimenter.” 

In the experiment on weights de- 
scribed earlier (27), value is the only 
“dimension of difference’ between the 
two series. It would also be a supple- 
mentary and important difference be- 
tween two series which differ not only 
with regard to value, but also with re- 
gard to some other dimensions. In the 
first case, the difference in value is the 
only contributing factor to a more pro- 
nounced accentuation of differences in 
the value than in the neutral series; in 
the second, it is one of the contributing 
factors. Once again, this does not neces- 
sarily mean that the subjects “see” the 
stimuli in the value series as being more 
different from each other, or larger, or 
smaller. They respond as if they per- 
ceived them as such, as this is as far 
as the phenomenology of it can go for 
the time being. 


“TRRELEVANT” DIMENSIONS AND THE 
ACCENTUATION OF DIFFERENCES 


The accentuation of differences be- 
tween the stimuli of a value series may 
partly account for the results obtained 
in the “relevant” experiments on over- 
estimation. However, it has no direct 
bearing on the problems raised by the 
results of the “irrelevant” group of ex- 
periments; nor does it help to under- 
stand why in the “relevant” group a 
rather consistent trend towards over- 
estimation of the value series should be 
superimposed, as it were, on the sharpen- 
ing of differences within it. 

The preceding discussion is not, how- 
ever, entirely irrelevant to these prob- 
lems. As was pointed out earlier, in a 
“relevant” series the subjects’ judgments 
are simultaneously determined by the 
perceived differences between the stim- 
uli of the series, and between a par- 
ticular stimulus of the series and an 
extraneous stimulus. In the “irrelevant” 
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experiments, where the magnitude under 
investigation seems in no way related to 
the presence or absence of value, the 
fact that one stimulus is “valued” and 
another “neutral” would form an addi- 
tional distinctive feature between them, 
superimposed on the physical differ- 
ences. Examples of such experiments 
have been given above (6, 19, 20). 
Beams’ study (2) on overestimation of 
size of favorite food objects by children 
provides another example. The com- 
mon feature of these experiments is that 
the subjects are requested to judge along 
a dimension which, in usual circum- 
stances, does not help to discriminate 
between the objects in terms of their 
value. Children, who in the experiment 
by Lambert et al. (20) were asked to 
report on the size of the chips, had just 
experienced a situation in which the 
value of the chips was determined by 
their color. The refugees, who in the 
study by Klein et al. (19) were judging 
the size of discs containing the swastika, 


would be unlikely, in any other circum- 
stances, to pay special attention to the 
size of the emblem. 

In these experiments the valued and 
the neutral objects belong to sharply 
distinctive categories of experience, and 
size is certainly not the basis of classi- 


fication. In such cases, to judge size 
exclusively on the basis of size would, 
once again, be possible only if the sub- 
jects were able to abstract one element, 
and not an important one, out of a com- 
plex compound of experience. A more 
likely assumption is that the qualitative 
“motivational” differences between the 
two kinds of stimuli function in the di- 
rection of accentuating the perceived 
differences between their “irrelevant” 
magnitudes. 

This would lead to the prediction that, 
all else constant, the perceived differ- 
ences of magnitude between valued (or 
otherwise relevant) and neutral stimuli 
would be larger than the corresponding 
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differences between two sets of neutral 
stimuli. It does not, however, allow for 
the prediction that the valued stimuli 
would be perceived as larger than the 
neuiral ones. 

Some interesting indications which 
may suggest a way out of this difficulty 
can be found in the experimental litera- 
ture. In the group of studies on “ir- 
relevant” magnitudes, three experiments 
seem typical of those which yielded 
negative (or partly negative) results. 
Klein e¢ al. (19) reported that their 
subjects did not reliably overestimate 
the discs containing the swastika. Bevan 
and Bevan (3), who used a “quasi-rep- 
resentative design” and a heterogeneous 
assortment of objects with their two 
children, found that the ratio of esti- 
mated to real size was not higher for the 
liked than for the disliked or neutral ob- 
jects. Finally, Lysak and Gilchrist (21) 
found that increasing complexity of de- 
sign was a significant determinant of 
increasing overestimation; but dollar 
bills were not judged larger than the 
corresponding rectangles, nor was there 
any correlation between overestimation 
and value of various dollar bills. Ex- 
periments by Lambert ef al. (20) and 
by Beams (2), already referred to, are 
representative of those in which positive 
results were reported. 

The experiments of Klein e¢ al. (or 
at least the parts of them involving the 
swastika) and of Bevan and Bevan (3) 
have one feature in common: the value 
of the objects used has nothing to do 
with their size. The data, as they are 
presented, do not allow one to draw 
conclusions about an accentuation of dif- 
ferences between the sizes of valued and 
neutral stimuli; but if there was such an 
accentuation, there are no grounds to 
assume on the basis of the present argu- 
ment that it should work, with any de- 
gree of consistency, in the direction of 
overestimating the valued objects. 

On the other hand, it seems reasonable 
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to say that the dictum “the bigger the 
better” expresses a profound truth to 
Beams’ 10- to 12-year-old Ss when they 
are confronted with a piece of fruitcake, 
a sugar cookie, or a “marshmallow- 
covered chocolate cupcake” (2, p. 197). 
Lambert ef al. worked with much 
younger children (3 to 5 years old); 
the number of chips of a particular color 
which they received was related to the 
amount of candy which was the final 
compensation for their efforts. It would 
not be surprising if this, in view of the 
age of the subjects, had a great deal to 
do with their judgments of size. 

The negative results of Lysak and Gil- 
christ (21) lend further support to the 
thesis that the accentuation of differ- 
ences will tend consistently towards an 
overestimation of valued objects only 
when there exists a valued series in 
which there is some discernible rela- 
tionship between magnitude and value. 
Their adult subjects knew from long 
experience that the value of dollar bills 
does not vary as a function of their size. 
They may have accentuated the differ- 
ences in size between the bills and the 
corresponding rectangles, but not neces- 
sarily or consistently by overestimating 
the bills more than the control stimuli. 

A comparison of this study with other 
experiments performed on adult subjects 
helps to bring out more clearly its par- 
ticular features which may account for 
the lack of positive results. In the ex- 
periment by Klein ef al., there is no 
tendency to overestimate the disc bear- 
ing the swastika; but in one of their 
two experimental situations, all results 
pertaining to the disc bearing the dollar 
sign are significant (19, Table 3, p. 103). 
This is further substantiated by Solley 
and Lee (26) who attempted “ .. . to 
determine whether the differences in per- 
ceived size of discs with symbols drawn 
upon them, as reported by Bruner and 
Postman, could be better explained by 
their hypothesis of symbolic value than 
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by the more general Gestalt principle of 
closure” (p. 142). They report an over- 
estimation of the discs containing a 
dollar sign (as compared with discs con- 
taining a figure of presumably equal de- 
gree of closure), but no such effect with 
the discs containing a swastika. The 
fact that a symbol of money drawn on 
a disc is strongly suggestive of the 
familiar series of coins, whose value in- 
creases with their size, may well be 
responsible for these results. 

The negative outcome of the experi- 
ment by Lysak and Gilchrist must also 
be taken in conjunction with an almost 
identical experiment by Dukes and Bevan 
(10), who reported positive results. 
They used a number of rectangular 
cards, identical in size but differing in 
value, and engaged in “gambling” with 
their subjects. A number representing 
its value was stamped on each of the 
cards. The cards were drawn at random 
from a bag, and at each draw the sub- 
ject won or lost an amount of money 
proportional to the positive or negative 
number printed on the card. As soon 
as each drawn card was returned to the 
bag, the subject was asked to match its 
size with one of a series of blank “refer- 
ence” cards spread out in front of him. 
Dukes and Bevan’s hypothesis was that 
“as the monetary value printed on the 
test card increases from zero, the sub- 
ject, regardless of winning or losing, will 
tend to select larger reference cards as 
his estimates” (10, p. 47). This was 
confirmed: a significant correlation 
emerged between the ratios of estimated 
to actual sizes and the monetary value, 
positive or negative, of the test cards. 

The situation was almost identical 
with one of the arrangements of Lysak 
and Gilchrist’s experiment (21). If the 
fact of conflicting results were to be 
ascribed, here or in other cases, to rela- 
tively unimportant differences in pro- 
cedure, attempts to generalize results 
from a great number of psychological 
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experiments would have to suffer a seri- 
ous setback. There is, however, one 
essential difference between the two ex- 
periments: Lysak and Gilchrist’s sub- 
jects knew that all the dollar bills, with 
which they were presented, were of 
equal size. This bit of information 
about the cards used was not available 
to Dukes and Bevan’s subjects. What 
is more, Dukes and Bevan included in 
their instructions the following sentence: 
“Here is a bag containing small white 
cards, varying slightly in size” (10, p. 
44, italics mine). It would be only 
natural for the subjects to assume that 
this variation followed some regular pat- 
tern, and that this pattern consisted of 
an increase in size corresponding to the 
increase in value. Here again, a con- 
tinuous variation along the dimension 
of value has carried over to the judg- 
ments of the physical dimension, and 
this has been further helped by a hint 
from the experimenter. 

But there is some evidence that the 
same could have happened even with- 
out the hint. In an experiment by Ash- 
ley, Harper, and Runyon (1), a “gray- 
ish metal slug” was presented several 
times to the subjects. At different pres- 
entations of the slug they were told that 
“it was made of either lead, silver, white 
gold, or platinum” (p. 568). Judgments 
of the size of the slug reflected its 
imaginary value: the size of the variable 
standard “that was called equal to the 
slug increased as the cost of the metal 
increased” (p. 572). As in the pre- 
ceding example, a progressive increase 
in value is related to an imaginary in- 
crease in size, while the actual size is 
kept constant. 

The evidence summarized so far lends 
strong support to the view that both the 
positive and the negative results of the 
studies on overestimation can be best 
understood in terms of a hypothesis of 
accentuation of differences. This ac- 
centuation would work along two axes: 
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between the stimuli of a series in which 
there is a concomitant variation of value 
and magnitude; and between the valued 
stimuli and the neutral ones. In the 
latter case, the accentuation would tend 
towards a relative overestimation of the 
valued stimuli only under certain con- 
ditions, which have been discussed 
above. 


SocIAL PERCEPTION 


Some implications of the present dis- 
cussion go beyond the problems raised 
by the phenomenon of overestimation. 
It may be said that, in a sense, “‘over- 
estimation” as discussed in this paper 
is a special case and a convincing ex- 
perimental paradigm of a more general 
aspect of social perception. Many so- 
cial objects and events are sharply clas- 
sified in terms of their value or rele- 
vance. When judgments concerning 
some quantifiable or ratable aspects of 
stimuli which fall into distinct categories 
are called for, differences in value or rele- 
vance cannot fail to influence the quan- 
titative judgments in the direction of 
sharpening the objectively existing dif- 
ferences between the stimuli. A very 
similar conception has recently been 
formulated by Hochberg (15), and ap- 
plied by him to the perception of in- 
and out-group individuals: 


If a group of individuals is perceived as 
different from the non-group individuals, the 
perceived differences between those within the 
group and those outside the group will auto- 
matically be sharpened, and the differences 
perceived between the members of the group 
(ie., intragroup differences), and between 
those outside the group will be lessened. 


These judgmental effects of categor- 
ization are probably fairly general; it is 
likely, however, that they are particu- 
larly pronounced when judgments are 
made in dimensions in which scaling in 
magnitude is simultaneously a scaling 
in value. Thus, it may well be that an 
accentuation of differences in size will 
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hardly occur between two paintings, one 
liked and one indifferent or disliked. 
But when skin color, or height, or some 
facial traits of social “value” are con- 
cerned, there will be a marked sharpen- 
ing of differences in the degree of these 
characteristics perceived as belonging 
to individuals who are assigned to dif- 
ferent categories. Some evidence of this 
is provided by a recent study on “per- 
ceptual accentuation and the Negro 
stereotype” conducted by Secord, Bevan, 
and Katz (24). Their results suggest 
that a group of prejudiced Ss sharpened 
the differences in the degree of Negroid 
physiognomic traits possessed respec- 
tively by Negroes and whites, more than 
did a group of nonprejudiced Ss. It is 
likely that the same is happening in the 
case of more abstract social judgments 
which are implicitly quantitative, such 
as, for example, those concerning the 
relative frequency of crimes in various 
social groups, as perceived by people 
who have an axe to grind. 


SUMMARY 


The thesis put forward in this paper 
is that the apparently conflicting results 
of studies concerned with perceptual 
overestimation can be best understood 
if this phenomenon is considered as a 
special case of accentuation of perceived 


differences. This would work in two 
directions. First, in a series of stimuli 
in which there exists a concomitant vari- 
ation of value and of some physical 
dimension, the perceived differences be- 
tween the elements of the series would 
be larger than in a corresponding neutral 
series. Secondly, the perceived differ- 
ences in magnitude between the stimuli 
which possess the attribute of value and 
those which do not would tend to be 
more accentuated than the correspond- 
ing differences between neutral stimuli. 
This would not, however, always result 
in overestimation. The sharpening of 
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differences resulting in overestimation of 
valued stimuli would occur only when, 
in the class of stimuli with the attribute 
of value, an increase in some physical 
magnitude is correlated with the increase 
in value. In other cases, there are no 
grounds for assuming that an accentua- 
tion of perceived differences between the 
two classes of stimuli would tend con- 
sistently towards an overestimation of 
the valued stimuli. 

In this way, overestimation is seen as 
one instance of a more general phe- 
nomenon. The accentuation of differ- 
ences between classes of stimuli occurs 
when these stimuli differ in some respects 
other than the dimension along which 
the subjects are reporting their judg- 
ments of quantity. The presence or 
absence of “value” or “relevance” is one 
such contrast. It is probable that this 
contrast is responsible not only for the 
results obtained in the field of over- 
estimation. It may also lead to the 
sharpening of differences between the 
quantifiable or ratable aspects of social 
objects and events. This occurs when 
the dimension along which judgments 
are made is not the primary basis for 
assigning the various stimuli into their 
sharply distinctive categories. 
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AND IMMEDIATE MEMORY * 
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Many people have a natural distaste 
for model building. A mechanical model 
is essentially a theory expressed in ma- 
terial parts rather than in abstract sym- 
bols such as words or mathematical ex- 
pressions. Its logical standing is the 
same: that is, it stands or falls by the 
degree to which it fits the results of ex- 
periments on human or other animals. 
Yet many models in the past have been 
somewhat undistinguished in the close- 
ness with which experiment has been 
considered in their design. It is diffi- 


cult to avoid feeling that this is because 
a model is unduly laborious to build as 
compared with a verbal theory, so that 
the builder tends to become obsessed 
with the properties of his model rather 


than those of the organism. Conse- 
quently there is much to be said for 
building theories verbally, and especially 
for using the qualitative terms of infor- 
mation theory in the hypothetico-deduc- 
tive fashion ably set out by Mackay 
(29). Such an approach has certain 
advantages over the other popular al- 
ternative of quantitative S-R terms; 
these advantages have been considered 
elsewhere (11). 

For example, the writer holds that the 
human perceptual system has a limited 
capacity, that in consequence a selective 
operation is performed upon all inputs 
to the system, and that this operation 
takes the form of selecting all inputs 
having some characteristic in common. 


1 This work was supported by the British 
Medical Research Council, and the writer 
works under the general direction of Dr. N. H. 
Mackworth. He has discussed the topic with 
profit with many individuals; Dr. J. Brown 
should be especially mentioned. 


Such an operation extracts little informa- 
tion from the signal and thus should be 
economical of nervous mechanism. Char- 
acteristics on which the selection can 
operate may be named “sensory chan- 
nels.” The particular selection made at 
any one time will depend partly on char- 
acteristics of the input itself (physical 
intensity, earliness in time, absence of 
recent inputs on that channel, position 
of the channel in the hierarchy of all 
channels) and partly on information in 
a more permanent store. The change 
from one selection to another will take 
a determinate time. 

Incoming information may be held in 
a more temporary store at a stage previ- 
ous to that of the selective operation. 
Such information will pass through the 
perceptual system on the next subse- 
quent selection of the sensory channel 
of its arrival, if it is still in store; but 
the probability of the latter condition 
being fulfilled will decline with time 
spent in store. After passage through 
the perceptual system, information may 
be returned to the same temporary 
store, the selection of information for 
such return being determined by infor- 
mation in a more permanent store. 

But although such a purely verbal 
theory may fit experimental results, it 
is difficult to communicate to others 
without putting them to the trouble of 
learning the necessary vocabulary. And 
if the theory is rephrased, still ab- 
stractly, it is open to misinterpretation; 
thus Deese (19), in a paper which 
makes a number of valuable contribu- 
tions to the theory of prolonged work, 
has described the writer as postulating 
an inhibitory construct in human per- 
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formance. This misunderstanding prob- 
ably arises from the fact that the the- 
ory outlined in the last two paragraphs 
is intended to apply both to condition- 
ing and to human watch keeping; in 
neither case is it thought necessary to 
find an inhibitory construct of orthodox 
learning theory type. To say that con- 
ditioning and perception are related is 
not to apply a particular interpretation 
of the former to the latter. 

Clearly, then, some sort of expository 
device is needed for an abstract theory 
using unfamiliar terms. And it is even 
difficult for the theorist to remember in 
abstract form the results of the many 
different experiments which a good the- 
ory should consider. A simple mechani- 
cal model has the virtue of avoiding 
these difficulties. It has other vices: it 
may have accidental properties which 
mislead research. Perhaps the best com- 
promise is to state a theory in abstract 
terms, and also to give a model which 
can be described by the same verbal 
theory. Information concepts are ap- 
plicable to ary system, whatever its 
physical nature, and so may equally fit 
a model or a man. This is the ap- 
proach of Deutsch (20, 21), and it 
has real advantages, independent of the 
value of his particular theory. The 
present paper is therefore intended to 
describe an extremely simple model of 
the human perceptual system. It may 
serve both as an easy introduction to 
the formal theory in information flow 
terms and also as a convenient mne- 
monic for the results of a number of 
experiments. 


Tue Bastc MopEL 


The necessary requirements are a Y- 
shaped tube (Fig. 1) mounted verti- 
cally, and a set of small balls. Each 
ball bears a number so that all are in- 
dividually recognizable. The Y tube 
has a narrow stem which will just take 
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Fic. 1. The simple model for attention. 


only one ball, though the branches are 
wider. At the junction of stem and 
branches is a hinged flap which nor- 
mally hangs straight downward, but 
which can be pivoted about its upper 
edge so as to close off either of the 
branches of the Y. This pivoting can 
be done by a handle from outside the 
tube, controlled by stored information; 
purists may wish to control the handle 
by a punched-tape system, but a human 
being is an adequate substitute. When 
the handle is left alone the flap moves 
freely so that a ball dropped into one 
arm of the Y will knock the flap aside 
and fall into the stem of the Y. 

In this model the balls represent the 
information from various stimuli. The 
branching arms represent different sen- 
sory channels; thus one might be one 
ear and one the other ear. Alterna- 
tively, one might be the ear and one the 
eye. (Sensory channel is not, however, 
quite equivalent to sense-organ, since we 
would treat sounds localized in different 
places as being on different channels.) 
The bottom of the Y represents a re- 
sponse output, so that the process of 
dropping a ball into the arms and ob- 
serving its emergence at the bottom is 
analogous to that of delivering a stimu- 
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lus and observing a response. The be- 
havior of the model resembles that of 
man in the following ways: 


(a) If two balls are dropped simul- 
taneously, one into each of the branches, 
they will strike the flap on both sides; 
it will not move and therefore they will 
jam in the junction. Numerous experi- 
ments show this “distraction” effect, but 
there are certain advantages in citing an 
auditory experiment (5, 33). These 
same experiments show that if the 
handle is used to shut off one branch 
before the balls are inserted, then the 
ball entering the other branch will 
emerge successfully, which is analogous 
to previous instructions. 

(6) If the two balls are not strictly 
simultaneous, the first to arrive will ob- 
tain an advantage by knocking the flap 
over and shutting out the other. This 
had been shown to be analogous with 
competing auditory stimuli by Spieth, 
Curtis, and Webster (36). 


(c) If the Y is not perfectly vertical, 
the ball in the more vertical branch will 
have an advantage over a simultaneous 
ball in the other because the door will 


hang to one side. Equally, one sensory 
channel may have an advantage over 
another, as has been shown for high- 
pitched noise as compared with low 
(10). 

(d) If one ball is flung violently 
down its branch, it may succeed in 
forcing over the door against the un- 
assisted weight of a ball on the opposite 
branch. Equally, an intense stimulus 
may have an advantage (2, 10). 

(e) After a single ball has been 
passed through the system, the door will 
swing back from the position into which 
it has been pushed. Naturally it will 
overswing, and temporarily close the 
branch which has just been used. A 
stimulus has similarly an extra advan- 
tage for response if it comes on a previ- 
ously quiet channel as opposed to a 
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previously busy one. This has been 
shown by Poulton (34) for auditory 
signals, and a related finding is that of 
Hyman (25) for visual reaction times. 
In the latter case, stimuli of different 
frequencies of occurrence were delivered, 
and it was found that average reaction 
time to a set of stimuli was proportional 
to the information conveyed by them; 
but the infrequent signals, while giving 
long reaction times, did not give times 
as long as those to be expected from 
information theory calculations. Note, 
however, that the time taken for a swing 
is important in the model; this will 
probably also be true in man, to judge 
from data which are best considered be- 
low. It may also be connected with Hy- 
man’s finding that, for numbers of al- 
ternatives greater than two, the second 
of two identical signals received an un- 
duly fast reaction. 

(f) If a given number of balls are to 
be put through the tube, it is best to 
deliver them asymmetrically, the ma- 
jority to one branch. There is then less 
risk of jamming than if equal numbers 
are admitted by both branches. The 
analogous finding for auditory messages 
has been reported by Webster and 
Thompson (37). This point is related 
to another, that if balls are being~ in- 
serted into one branch at random in- 
tervals the effect of increasing the rate 
of delivery of balls through that same 
branch (the “speed” of work) is not the 
same as that of adding the same num- 
ber of extra balls to the other branch 
(the “load” of work). There is more 
risk of jamming in the latter case. This 
point was first clearly emphasized by 
Conrad, using visual signals (16). Ob- 
viously the effect of using two branches 
rather than one will be more serious if 
the rate of delivery of balls is high, 
since this increases the probability of a 
jam. Conrad showed such an interac- 
tion of speed and load. Mackworth and 
Mackworth (28) have shown a similar 
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effect in a different type of task, and 
have demonstrated that the fluctuating 
difficulty of this complex task at any in- 
stant can be represented by the amount 
by which each signal is overlapped by 
other simultaneous ones. 

At this point we may pause to con- 
sider the cynical reader who is wicked 
enough to be doubtful of the existence 
of a Y-shaped tube somewhere in the 
region of the thalamus. Such a reader 
will probably have noticed already a 
situation which will follow when: two 
balls are dropped into the model and 
jamming is avoided by the door being 
to one side for one of the reasons listed. 
The favored ball will descend the stem 
of the Y, but the impeded ball will 
not therefore disappear. It will emerge 
later, when the door next swings back 
to the opposite branch. Surprisingly 
enough, this also happens with man. 
Simultaneous stimuli either jam or pro- 
duce successive responses.? It has been 
shown for three different combinations 


of sensory channels (7, 8), and is prob- 
ably an effect identical with the “prior 


entry” of classical psychology. In the 
same way Conrad has not only shown 
complete failures to respond (17) but 
also shifts in the time at which responses 
appear (18). 

Having thus demolished the cynic, 
we must add that a slight complication 
should be added to the model in order 
to cover perfectly the results of experi- 
ments on successive response to simul- 
taneous stimuli. But this will be left 


2 At the risk of complicating the issue, we 
must say that “stimuli” in this sentence means 
“stimuli not of low information content.” A 
familiar predictable sequence of stimuli may 
quite well produce responses simultaneous with 
other responses (1). And the same S-R unit 
interferes less with another task when drawn 
from a smaller ensemble of possibilities (12). 
This point is of course implied in the state- 
ment that the perceptual mechanism has a 
limited capacity in the sense of information 
theory. 
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until the section on immediate memory; 
let us first deal with the effects of speed 
and of prolonged performance on the 
simple model. 

(g) As the stem of the Y is so nar- 
row, there will be a certain amount of 
delay between the insertion and emer- 
gence of each ball. At slow rates of in- 
sertion each ball will, however, emerge 
before the next is inserted. At faster 
rates there will develop a lag such that 
one ball may be inserted before the 
previous one emerges. With still faster 
rates balls will begin to accumulate in 
the branch; response will get further 
and further behind stimulus as the task 
proceeds. But this is a desperate ex- 
pedient and will lead to breakdown 
when the branch is full unless the rate 
slows down again. These stages have 
been shown by Vince (35) for visual 
stimuli; a related effect has also been 
shown in less detail in hearing (4). A 
result of the piling up of balls in the 
branch when the rate is too high is that 
a ball inserted at a very short interval 
after another will remain in the tube 
longer than is normal: the “psychologi- 
cal refractory period” (38). It must 
be remembered that balls are analogous 
to information and not to stimulation; 
highly probable stimuli will not give an 
unduly long reaction under these condi- 
tions (22). 

(4) Now suppose that we wish to op- 
erate the handle in such a way that one 
branch always has priority: prolonged 
performance of a task involving only 
one sensory channel. If there is a 
fairly rapid flow of balls down the se- 
lected branch, the handle will hardly 
need holding after the initial setting, 
since the beginning of any swing back 
will probably be checked by another 
ball. But if the selected branch is not 
very busy, the weight of the door must 
be held by a positive force on the 
handle. In this case the fingers holding 
the handle will fatigue; for purists, the 
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punched-tape machine will only hold one 
branch shut for a limited period. When 
the handle is released the door will 
swing back to close the opposite branch, 
and then return pendulum-wise to the 
desired position when it will again be 
held. The result will be that prolonged 
tasks in which unexpected stimuli ap- 
pear for less than a certain period (the 
“swings of the door”) will show marked 
decrements in performance. But simi- 
lar tasks in which the stimuli are pres- 
ent for longer times will show much less 
decrement. 

The earlier evidence for this view of 
prolonged performance has been sum- 
marized elsewhere (6). But earlier pres- 
entations have misled Deese (19) into 
supposing that this view implied a dec- 
rement with continued performance of 
any vigilance task. He employed tasks 
in which a signal was either presented 
repeatedly until seen, or else painted on 
a tube face by a sweep line which did 
not return to obliterate the signal for 
nearly three seconds. It was in fact 
predicted by the writer (6, p. 300) that 
such tasks would show little or no dec- 
rement, and this was found by Deese to 
be the case. The latter type of task 
gave more signs of decrement, probably 
being a borderline case, since the trace 
left on the phosphor screen decayed 
until scarcely visible at the end of the 
three seconds. It should be noted that 
an expectancy theory of the type fa- 
vored by Deese, while undoubtedly ap- 
plicable to some aspects of vigilance, is 
not able to account for the effects of 
varying the length of signal presenta- 
tion. In our Y-tube model, the role of 
expectancy is incorporated by describ- 
ing the balls as representing information 
rather than stimulation; the more prob- 
able signals receive more efficient re- 
sponse. But the swinging of the flap is 
also necessary. 

To summarize the writer’s present 
views on vigilance, the efficiency of a 


209 


man asked to detect infrequent signals 
should be described by both a mean 
and a variance. The mean level is de- 
termined by such factors as the rate at 
which signals arrive but not necessarily 
by the length of time since the session 
began. The variance, on the other hand, 
increases as the session progresses, short 
intervals of extremely low efficiency be- 
ing interspersed with fairly long periods 
of normal or supernormal efficiency. 
The score from any given task may de- 
pend on one or on the other of these 
quantities. Thus, for example, the ear- 
lier British work has been mostly con- 
cerned with the instants of very low 
efficiency; the type of results reported 
by Deese and by later British workers 
(6) mostly with the mean over ap- 
preciable periods of time. 

To return to our model, two further 
points should be made. The first is 
that the door need not swing the whole 
way over when the handle is released; 
it may reach a central position just as 
two balls arrive. There will then be a 
jam rather than passage of the wrong 
ball. Equally, failure of reaction to a 
task need not imply overt reaction to 
some irrelevant stimulus. 

Secondly, as the balls represent in- 
formation rather than stimulation, tasks 
in which the sequence of stimuli is pre- 
dictable will not show fatigue decre- 
ments of this type. To show decre- 
ments, the signals must be unpredictable 
either in content, as were those of Bills 
(3), whose “blocks” are instances of 
this effect, or else in time of occurrence, 
as were those of Mackworth (27); this, 
however, is only one of the necessary 
conditions. :! 


IMMEDIATE MEMORY 


Our Y tube does seem at this stage to 
have related a number of facts about 
perception and put them in a way which 


most people can understand. It is ad- 
mittedly ludicrous as a description of 
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what really happens in the brain, but 
this is a positive advantage. Psycholo- 
gists are not likely to mistake this model 
for speculative neurology, and so they 
should concentrate their experiments on 
the essentials of the theory rather than 
the irrelevant properties of the model. 
As a device for communicating the out- 
line of the theory, however, the model 
seems sufficiently adequate to justify an 
extra complication in order to express a 
theory of immediate memory. This the- 
ory is in a slightly different position 
from the views on perception which have 
been given so far; the latter are entailed 
by the experiments, but the theory of 
immediate memory is not the only con- 
ceivable explanation of the observed 
facts. Yet it has a fairly high probabil- 
ity and is worth discussing. 

The complication is twofold. First, a 
device must be supposed fitted to each 
branch of the Y tube, such that if any 
individual ball remains in the branch 
for more than a certain time continu- 
ously it is removed from the system 
completely. This could be done by fill- 
ing the tube with acid, but the writer 
does not wish to encourage the develop- 
ment of a race of fingerless psycholo- 
gists. Mechanical devices are quite pos- 
sible, though complicated to describe, 
and the details will therefore be left un- 
specified and available from the author. 
The second complication is that from 
the foot of the stem two return tubes 
lead back to the branches (Fig. 2). 
Admission to these return tubes is con- 
trolled by a lower door which again is 
operated by a handle. The latter is 
dependent on an outside operator, or 
the familiar punched-tape machine—in 
brief, some form of stored information. 
Finally, as the return tubes are operat- 
ing against gravity they must contain 
some form of conveyor, but this has no 
particular psychological significance. 

In immediate memory experiments, as 
usually performed, a stream of stimuli 
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Fic. 2. The model modified to illustrate the 
theory of immediate memory as a recurrent 
circuit; or, in other terms, as a fading trace 
periodically revived by rehearsal. 


is delivered completely before any overt 
response is required. Thus all the in- 
formation in the stream is within the 
organism at one time; all the balls are 
somewhere in the tubes. Yet with small 
numbers of balls the time between in- 
sertion and emergence can be varied 
without any apparent effect on efficiency 
(13). Let us consider some of the 
methods which may be adopted. 

(a) In the first place, let us suppose 
a short series of balls to be waiting in 
one of the branches. If they stay there 
indefinitely, the critical time will be ex- 
ceeded and the ball which has been 
there longest will leave the system. 
Thus the branch will not serve as a 
store during an unlimited period of de- 
lay before response is allowed. What 
is perfectly possible, however, is for the 
series of balls to pass down the stem 
and, by the lower door, back up a re- 
turn tube to the other branch. As soon 
as the full series has completed this 
round the process can be repeated, so 
that the set of balls can be kept in the 
system for any desired time. But note 
that insertion of any other ball during 
that time will either mean that the extra 
ball will never emerge, or else that the 
circulation of the series will meet inter- 
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ference. Note also that the system will 
only operate with a limited number of 
balls; above that number each ball will 
have to wait in the branch for more 
than the critical time, and there will 
therefore be a severe breakdown of stor- 
age. These are well-known character- 
istics of immediate memory. Less ob- 
vious is the suggestion that the interfer- 
ing effect of an extra ball during the 
delay will depend on the size of the 
series circulating, being most serious 
near the limit. This has been shown by 
Brown (14). He also demonstrated 
that the interval before or after the in- 
terpolated stimuli had little effect, as 
one would expect, and that if the extra 
stimuli came before the memory span 
stimulus there was little effect. It is 
Brown’s explanation of his results which 
has prompted the present account. 

(6) If both branches contain balls, 
movements of the door will put one 
group into circulation before the other; 
to move the door over while a group is 


actually passing would be to risk a jam. 
So all balls on one branch will pass 
down the stem before any on the other. 
As has been said, the memory analogy 
is in fact true, if a digit memory span is 
obtained with half the digits on one ear 
and half on the other, or half on the eye 


and half the ear (7, 8). In addition, 
the first branch to be dealt with under- 
goes less risk of any ball reaching the 
critical time; equally, performance is 
better on the first set of digits to be 
recalled (9, 13). 

(c) So far we have considered only 
the case in which the branches start 
with balls in them. But in practice 
these balls will be inserted into the 
branch one after another, and the extra 
time intervals thus produced will have 
their effect. Thus, for example, if balls 
are inserted rapidly into both branches 
simultaneously, the difficulty of moving 
the door will apply, as mentioned in the 
last paragraph; and so all the balls in 
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one branch will be dealt with before any 
in the other. But if there is a sizeable 
interval between successive balls in each 
branch, the door may be swung back 
and forth so as to deal with each branch 
alternately. A slow presentation of 
stimuli may equally allow them to be 
dealt with in the actual order of arrival 
rather than channel by channel (7, 8). 
It should be noted that the time allotted 
to “swings of the door,” as measured by 
these experiments, agrees with that de- 
termined by the experiments on pro- 
longed performance mentioned previ- 
ously (6). 

(d) Again, suppose a lengthy series 
of balls is inserted through one branch, 
the door being held open for them by 
the handle. If a couple of balls are 
meanwhile delivered to the other branch 
and wait there till the stem is clear for 
them to pass, there is naturally some 
risk that this extra pair of balls will ex- 
ceed the critical time. The risk will be 
greater if they are inserted with the ear- 
lier balls of the long series than if in- 
serted with the later balls. This also is 
true of immediate memory for spoken 
digits arriving at the two ears (9). A 
point of interest concerns the effect of 
“irrelevant” balls which arrive on the 
second branch during a long series on 
the first. If all the balls in the second 
branch are irrelevant, the door may be 
kept closed against them and they will 
eventually exceed the critical time and 
be removed (15). But if some relevant 
balls are also on the second branch, the 
door will have to be opened and the 
mixture of relevant and irrelevant balls 
passed down the stem. The former can 
be recirculated in the usual immediate 
memory fashion, and the latter removed 
by the lower door. The presence of 
irrelevant balls either before or after 
relevant ones will therefore produce a 
greater risk of jamming or of the criti- 
cal time being exceeded. This also is 
true of immediate memory (9). 
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(e) Now consider cases in which the 
order of the balls emerging at the bot- 
tom of the stem is different from that of 
their insertion. The extreme case is that 
of “backward memory span.” With our 
simple model, the easiest situation to 
consider is that in which the first half 
of a series of balls is to emerge after the 
second half. This can be arranged on 
the first circulation by passing the first 
balls down the stem and back up the 
return tube to the branch opposite to 
that in which they were inserted. The 
later balls are returned to the same 
branch which they entered, and on the 
second circulation are the first to be ad- 
mitted to the stem. After they have 
passed the flap the other balls can fol- 
low them. This will mean that the first 
balls have stayed in the branch longer 
than will normally be necessary for sim- 
ple recirculation, and so are more likely 
to exceed the critical time. Therefore 
such a rearrangement of order will re- 
duce the “memory span.”’ Furthermore 


it will alter the order of difficulty, since 
the first members of the series will suffer 


more than the later ones. These effects 
have been shown by Kay and Poulton 
(26), and are supported by Brown (13). 
The reduced memory span for a rear- 
ranged list is of course familiar from in- 
telligence testing, but the change in the 
serial-order effect is a more important 
deduction. 

(f) Serial-order effects will clearly de- 
pend very considerably on the rates of 
presentation of stimuli and of required 
response, as compared to the rate of re- 
circulation. They will also depend on 
the way in which stored information is 
used to operate the two handles—that 
is, on the strategy of the subject. They 
will certainly not be completely deter- 
mined by the primitive learning process, 
as is supposed by some existing theories. 
In fact it has been shown by Kay and 
Poulton (26) that the serial-order ef- 
fect is altered by the subject’s absence 
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of knowledge about the order in which 
recall will be required, even though the 
actual order remains the same as in a 
control experiment. (Knowledge of the 
amount to be recalled will also affect the 
efficiency of recall as well as the order, 
as has been shown by Brown [13]. 
This is because the uninformed subject 
must recirculate material which is not 
in fact to be recalled later; retention is 
an active process. ) 

Some general points about serial-order 
effect may be made, however. First, if 
the rate of response is below that of 
recirculation, but not so slow as to al- 
low complete recirculation between re- 
sponses, the first items in response will 
have stayed in store less long than the 
later items. So, as in (5) above, the 
earlier part of a list recalled in the or- 
der of presentation will be better re- 
called than the later part. Second, if 
the time taken to respond is eliminated 
—for instance, by requiring recall of 
only one item in the series—then the 
time taken to present the stimuli will be 
the chief factor influencing serial-order 
effects. The last stimuli to arrive will 
then be those which have been stored 
for the shortest time, and will be best 
recalled. This is the result of Gibson 
and Raffel (23). 

Third, a special case of some impor- 
tance arises when fresh material is be- 
ing presented while earlier material is 
being recalled. Considering the model, 
suppose a short series of balls are in- 
serted and recirculated to the branch 
opposite that by which they arrived. If 
they are now to be passed down the 
stem during the arrival of fresh balls on 
the original branch, the earliest of the 
second series of balls will suffer the 
longest delay. Consequently the end of 
the first series and the beginning of the 
second series will be the points of great- 
est difficulty. If the two series are con- 
sidered as one long one, the familiar 
U-shaped serial-order effect will appear. 
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This has been demonstrated by Poulton 
(32). In conventional learning experi- 
ments subjects are not instructed to re- 
hearse (recirculate) the earlier items 
during presentation of the later ones. 
But it seems plausible that, during the 
presentation of a long series at a me- 
dium rate, the presence of the earlier 
balls in the second branch should nor- 
mally encourage a tendency to recircu- 
late them while the later balls are still 
arriving. This would be comparable to 
the higher priority of a previously quiet 
channel, which was mentioned in the 
section on perception. It would result 
in the U-shaped curve of difficulty be- 
ing the usual one for serially presented 
material; but the curve will be subject 
to great modification by instructions. 
Furthermore a very slow rate of pres- 
entation, resulting from allowing recir- 
culation during gaps between presenta- 
tions, will minimize the effect. This is 
the case (24). 


LIMITATIONS OF THE MODEL 


Certain properties of the model are 


likely to be misleading. Of these the 
most important has been stated several 
times above, but is worth repeating. 
The balls represent information, not 
stimulation. The reader must not con- 
template the Y tube and decide that 
two stimuli cannot be dealt with simul- 
taneously. They can if they convey 
sufficiently little information. Clearly, 
many reflexes are compatible with one 
another, and it is likely that simple 
“voluntary” reactions are equally ca- 
pable of being carried on simultane- 
ously. It is only with unpracticed reac- 
tions involving a choice between several 
alternatives that we find an interfer- 
ence between two stimuli; but this is 
a very normal case outside the labora- 
tory. Theories such as that put for- 
ward by Welford (38) for the “refrac- 
tory period” undoubtedly need the 
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qualification that highly probable stimu- 
lus sequences may not show these ef- 
fects (22). But this does not disprove 
their general value. 

A related point is more serious. The 
length of the immediate memory span is 
roughly constant whatever the size of 
the ensemble from which the items are 
chosen (31): one cannot remember 
enough binary digits to make the in- 
formation in immediate memory equal 
to that stored in memorizing ordinary 
decimal digits (30). Yet the model 
makes the limit on memory span de- 
pendent on the time taken to pass the 
items through a limited capacity sys- 
tem, which will in turn depend on the 
information per item. Perhaps this diffi- 
culty may be resolved by suggesting 
that each possible item represents an 
extra branch on the stem, and an in- 
coming ball is always recirculated to its 
appropriate branch. The time taken to 
withdraw all the balls again would then 
depend on the time taken by the flap to 
operate, which is not dependent on the 
information per item. But this is clearly 
leading us into complications; for the 
present we may merely note that the 
point is an important one about im- 
mediate memory but not impossible to 
handle with a model of this type. A 
minor caution which should be added is 
that the model is deterministic while all 
the experimental results quoted are sta- 
tistical. 

Finally, we have put forward this 
model as one which may be described 
by an exactly worded theory which ap- 
plied also to man. The present paper 
is directed largely at those who find 
such a theory unintelligible in its origi- 
nal form, but it should be borne in mind 
that the theory under test is the abstract 
one given in the first section. Other- 
wise the error of identifying the model 
with the organism may be made, if only 
to discredit the <heory by its obvious 
absurdity. The formulation given ear- 
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lier is not complete, but it indicates the 
way in which the model and the man 
may be described by the same abstract 
theory. There should therefore be no 
excuse for treating the Y tube as any- 
thing more than an expository device 
and a mnemonic for recalling the results 
of numerous experiments. The writer’s 


freedom from such an error is demon- 
strated by the fact that he has never 
built his model in any physical sense. 


SUMMARY 


A mechanical model is described, to 
act as an easy introduction to a formal 
theory of attention and immediate mem- 
ory in information theory terms. A 
number of deductions from the theory 
which agree with experimental results on 
human beings are given as descriptions 
of the behavior of the model. 
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